Global 21cm signal experiments: A designer's guide 
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The global [i.e. spatially averaged) spectrum of the redshifted 21cm line has generated much 
experimental interest lately, thanks to its potential to be a direct probe of the Epoch of Reionization 
and the Dark Ages, during which the first luminous objects formed. Since the cosmological signal 
in question has a purely spectral signature, most experiments that have been built, designed, or 
proposed have essentially no angular sensitivity. This can be problematic because with only spec- 
tral information, the expected global 21cm signal can be difficult to distinguish from foreground 
contaminants such as Galactic synchrotron radiation, since both are spectrally smooth and the lat- 
ter is many orders of magnitude brighter. In this paper, we establish a systematic mathematical 
framework for global signal data analysis. The framework removes foregrounds in an optimal man- 
ner, complementing spectra with angular information. We use our formalism to explore various 
experimental design trade-offs, and find that 1) with spectral-only methods, it is mathematically 
impossible to mitigate errors that arise from uncertainties in one's foreground model; 2) foreground 
contamination can be significantly reduced for experiments with fine angular resolution; 3) most of 
the statistical significance in a positive detection during the Dark Ages comes from a characteristic 
high-redshift trough in the 21 cm brightness temperature; 4) Measurement errors decrease more 
rapidly with integration time for instruments with fine angular resolution; and 5) Better foreground 
models can help reduce errors, but once a modeling accuracy of a few percent is reached, significant 
improvements in accuracy will be required to further improve the measurements. We show that if 
observations and data analysis algorithms are optimized based on these findings, an instrument with 
a 5° wide beam can achieve highly significant detections (greater than 5cr) of even extended (high 
Az) reionization scenarios after integrating for 500 hrs. This is in strong contrast to instruments 
without angular resolution, which cannot detect gradual reionization. Ionization histories that are 
more abrupt can be detected with our fiducial instrument at the level of lO's to lOO's of a. The ex- 
pected errors are similarly low during the Dark Ages, and can yield a 25cr detection of the expected 
cosmological signal after only 100 hrs of integration. 

PACS numbers: 95.75.-z,98.80.-k,95.75.Pq,98.80.Es 



I. INTRODUCTION 

Measurements of the highly redshifted 21 cm line are 
thought to be the primary way to make direct obser- 
vations of the epoch of reionization and the preceding 
dark ages, when the first luminous objects were formed 
from primordial fluctuations [l] . Theoretical studies have 
shown that observations of the 21cm line will not only 
provide crucial constraints on a relatively poorly under- 
stood period of structure formation, during which a com- 
plex interplay of dark matter physics and baryon astro- 
physics produced large scale changes in the intergalactic 
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medium [2]-[4]; eventually, the enormous volume probed 
by the 21 cm line will also allow one to make exquisite 
measurements of fundamental physics parameters [SHE]- 
It is thus no surprise that numerous experimental groups 
are making concerted efforts to arrive at the first positive 
detection of the cosmological 21 cm signal. 

To date, most observational efforts have focused on 
understanding the fluctuations in the 21 cm line by mea- 
suring the 21 cm brightness temperature power spectrum 
(although there have also been theoretical proposals to 
capture the non-Gaussianity of the signal [9-12^). These 
include the Murchison Widefield Array (MWA) [13], the 
Precision Array for Probing the Epoch of Reionization 
(PAPER) [14 , the Low Frequency Array (LOFAR) [IS], 
and the Giant Metrewave Radio Telescope Epoch of 
Reionization (GMRT-EoR) [TO projects. These exper- 
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iments are difficult: high sensitivity requirements dictate 
long integration times to reach the expected amplitude 
of the faint cosmological signal, which is dominated by 
strong sources of foreground emission (such as Galactic 
synchrotron radiation) by several orders of magnitude in 
all parts of the sky [[TTHTQ] . While lately these experi- 
ments have made much progress towards a detection of 
the cosmological signal, the challenges remain daunting. 

As an alternative way to detect the 21cm cosmologi- 
cal signal, there have recently been attempts to measure 
the global 21 cm signal, where one averages the signal 
over all directions in the sky and focuses on extracting a 
globally averaged spectrum, probing the evolution of the 
mean signal through cosmic history [20 . These observa- 
tions are complementary to the power spectrum measure- 
ments, and have been shown to be an incisive probe of 
many physical parameters during reionization [2j. Ex- 
amples of global signal experiments include the Experi- 
ment to Detect the Global EoR Signature (EDGES) [22] . 
the Large aperture Experiment to detect the Dark Ages 
(LEDA) [23 , the Long Wavelength Array (LWA) [24], 
and if funded, the Dark Ages Radio Explorer (DARE) 

Compared to power spectrum measurements, global 
signal experiments are in some ways easier, and in other 
ways more difficult [26] . A rough calculation of the ther- 
mal noise reveals that global signal experiments need far 
less integration time to reach the required sensitivities 
for a detection of the cosmological signal. However, the 
problem of foreground mitigation is much more challeng- 
ing. In a power spectrum measurement, one probes de- 
tailed maps of brightness temperature fluctuations in all 
three dimensions. The maps are particularly detailed in 
the line-of-sight direction, since for a spectral line like 
the 21cm line this translates to the frequency spectrum 
of one's measurement, and typical instruments have ex- 
tremely fine spectral resolution. The result is that the 
cosmological signal fiuctuates extremely rapidly with fre- 
quency, since one is probing local structure. In con- 
trast, the foregrounds are spectrally smooth. This dif- 
ference has formed the basis of most foreground subtrac- 
tion schemes that have been proposed for power spectrum 
measurements, and theoretical calculations have been en- 
couraging [27-36 . Global signal experiments cannot eas- 
ily take advantage of this difference in fiuct nation scale, 
for the globally averaged spectrum does not trace local 
structures, but instead probes the average evolution with 
redshift. The resulting signals are thus expected to be 
rather smooth functions of frequency, which makes them 
difficult to separate from the smooth foregrounds. Tradi- 
tionally, experiments have attempted to perform spectral 
separations anyway, and have thus been limited to ruling 
out sharp spectral signatures such as those that might 
arise from rapid reionization [22 . 

In this paper, we confront the general problem of ex- 
tracting the global 21 cm signal from measurements that 
are contaminated by instrumental noise and foregrounds, 
placing the problem in a systematic mathematical frame- 



work. We adopt the philosophy that one should take 
advantage of every possible difference between the cos- 
mological signal and contaminants, not just the spec- 
tral differences. To do so, we first develop optimal data 
analysis methods for cosmological signal estimation, us- 
ing angular information to complement spectral informa- 
tion. Traditional spectral-only methods are a special case 
in our formalism, and we prove that in such methods, 
foreground subtraction simply amounts to subtracting a 
"best-guess" foreground model from the data and doing 
nothing to mitigate possible errors in the model itself. 

Additionally, we build on existing foreground models 
in the literature, and use our improved model along with 
our data analysis formalism to predict expected mea- 
surement errors given various experimental parameters. 
Since our formalism allows these measurement errors to 
be calculated analytically (given a foreground model), we 
are able to explore parameter space efficiently to identify 
the best experimental designs and the best data analysis 
methods. We find that the most important such "lesson 
learned" is that angular information is necessary to sup- 
press foregrounds to a low enough level to differentiate 
between different cosmological models. 

Our paper therefore has two logically separate (but 
closely related) goals. The first is to develop a robust 
signal extraction formalism, and we fulfill this goal with- 
out assuming a precise model for the cosmological signal, 
thus immunizing ourselves to uncertainties in theoretical 
modeling. The second is to build intuition for instrumen- 
tal design and to forecast the performance of different 
types of instruments. This part does assume a theoreti- 
cal form for the cosmological signal. 

In this paper our focus is on foreground removal, so for 
simplicity we do not consider calibration errors. How- 
ever, we do note that an accurate bandpass calibration 
(or at least a good estimate of the calibration error) is 
a prerequisite for accurate foreground removal. It is for 
this reason that much effort has been spent on better 
instrument characterization recently [37 . We also do 
not include frequency-dependent beam effects, although 
the formalism of this paper is general enough to include 
them. Both calibration errors and frequency-dependent 
beams are inevitably instrument-specific issues, so we de- 
fer detailed investigations of them to future work. 

The rest of this paper is organized as follows. In Sec- 
tion [H] we detail the properties of the cosmological signal 
and our foreground and instrumental noise model. Read- 
ers that are more interested in data analysis methods may 
wish to skim this section (perhaps reading in detail only 
the summary in Section II B 4 ), and skip ahead to Section 
IIIIl where we establish our mathematical framework for 



signal extraction and error computation. Those that are 
interested in experimental design may wish to focus on 
Sections IV and |V| where we explore various experimen- 
tal trade-offs for observations targeting the dark ages and 
reionization respectively. Forecasts for the performances 
of some fiducial experiments are presented in Section [Vl| 
and we summarize our conclusions in Section (VlII 
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II. THE COSMOLOGICAL GLOBAL 

SPECTRUM AND SOURCES OF 
MEASUREMENT CONTAMINATION 

A. Model of the signal 

We begin by sketching the ingredients of our 21 cm 
global signal model. For more details there exist several 
good reviews on the physics of the 21 cm signal [2 - 4 . The 
21 cm signal arises from the physical properties of neutral 
hydrogen in the intergalactic medium (IGM), specifically 
from the density of neutral hydrogen and the 21 cm spin 
temperature, which describes the relative number of hy- 
drogen atoms with proton and electron spin aligned or 
anti-aligned. These quantities respond to radiation from 
luminous sources and so are expected to vary from place 
to place. 

Fluctuations in the 21 cm signal are being targeted by 
radio interferometers such as LOFAR, MWA, PAPER, 
and GMRT, as mentioned in Section [l| These fiuctuation 
are scale dependent with most power on the characteristic 
scale of ionized or heated regions, which are believed to 
be tens of arc-minutes across. When viewed with a beam 
larger than this characteristic size these fiuctuations will 
average out, giving a measure of the mean or "global" 
21 cm signal. In this paper, we concentrate on measur- 
ing this isotropic part of the 21cm signal and consider 
the anisotropic fiuctuations as a source of noise. Stud- 
ies by [38] showed that during reionization a beam size 
of a few degrees is sufiicient to smooth out most of the 
fiuctuations. In this paper, any contribution of the fiuc- 
tuations left over after convolving with the large beam 
will be considered as irreducible noise. 

The basic dependence of the differential 21cm bright- 
ness temperature T5 on the average ionized fraction Xi 
and spin temperature Ts is 

n ^ 27(1 - X.) -J^"^ ] ( '4^] mK, (1) 



Ts J 
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where we have used the WMAP7 cosmological parame- 
ters to fix the baryon and mass abundances as Q^h'^ = 
0.023 and Qrah'^ = 0.15 [39 . The key redshift depen- 
dence comes via Xi{z) and Ts{z), which we model fol- 
lowing the approach of [40], where the reader will find 
technical details. This model incorporates (1) the ioniz- 
ing radiation from galaxies, (2) X-ray heating from galax- 
ies [41], (3) Lyman-alpha emission from galaxies, and as- 
sumes a simple prescription linking the star formation 
rate to the fraction of mass in collapsed structure above 
a critical mass threshold required for atomic hydrogen 
cooling. 

This model predicts a 21 cm signal that divides into 
three qualitatively different regimes (see Figure [T]). The 
first, a shallow absorption feature at 30 ^ 2; ^ 200 be- 
gins as the gas thermally decouples from the CMB and 
ends as our Universe become too rarified for collisions to 
couple Ts to Tgas- Next, a second and possibly deeper 



absorption feature occurs as the first galaxies form at 
z ^ 30. This is initiated as Lyman alpha photons illu- 
minate the Universe, coupling spin and gas temperatures 
strongly, and ends as increasing X-ray emission heats the 
IGM above the CMB, leading to a 21 cm emission signal. 
This emission signal is the third key feature, which slowly 
dies away in a "reionization step" as ionizing UV photons 
ionize the IGM. As described in [21, 42j there is consid- 
erable uncertainty in the exact positions and details of 
these features, but the basic picture seems robust. 

The last two features — an absorption trough driven by 
the onset of galaxy formatior|^ and an emission step ac- 
companying reionization — form the focus of this paper, 
since these seem most likely to be detectable (a fa ct tha t 
we will explain and rigorously justify in Section IV B). 



The earliest absorption trough seems unlikely to be de- 
tected in the near future, since it occurs at frequencies 
u < 50 MHz that are more strongly affected by the 
Earth's ionosphere and where galactic foregrounds are 
considerably brighter. 

It is important to note that the data analysis methods 
presented later in this paper do not depend on the precise 
form of the cosmological signal. This differs from other 
studies in the literature (such as [43]), which assume a 
parametric form for the signal and fit for parameters. In 
contrast, we use the theoretical signal shown in Figure [T] 
only to assess detection significance and to identify trade- 
offs in instrumental design; our foreground subtraction 
and signal extraction algorithms will be applicable even 
if the true signal differs from our theoretical expectations, 
making our approach quite a robust one. 



B. Generalized noise model 

We now construct our noise model. We define the 
generalized noise (or henceforth just "noise") to be any 
contribution to the measurement that is not the global 
21cm signal as described in the previous section. As 
mentioned above, by this definition our noise contains 
more than just instrumental noise and foregrounds. It 
also includes the anisotropic portion of the cosmological 
signal. In other words, the "signal" in a tomographic 
measurement (where one measures angular anisotropics 
on various scales) is an unwanted contaminant in our 
case, since we seek to measure the global signal {i.e. the 
monopole). 

If we imagine performing an experiment that images 
A/'pix pixels on the sky over TVfreq frequency channels, the 
noise contribution in various pixels at various frequencies 
can be grouped into a vector n of length equal to the 
number of voxels A^vox = ^pix^freq in our survey. It is 



^ For linguistic convenience, we will include the absorption trough 
at z ~ 20 as part of the Dark Ages, even though it really marks 
the very end of the Dark Ages. 
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FIG. 1. Target 21 cm global signal as predicted by the model of 21 . The exact details of this signal are uncertain and depend 
upon the nature of the first galaxies. 



comprised of three contaminants: 



1. Foreground Model 



n ' 



(2) 



where n^^, n^"®*, and signify the foregrounds, instru- 
mental noise, and anisotropic cosmological signal, respec- 
tively. Throughout this paper, we use Greek indices to 
signify the radial/frequency direction, and Latin indices 
to signify the spatial directions. Note that n is formally 
a vector even though we assign separate spatial and spec- 
tral indices to it for clarity. In the following subsections 
we discuss each of these three contributions to the noise, 
with an eye towards how each can be mitigated or re- 
moved in a real measurement. We will construct de- 
tailed models containing parameters that are mostly con- 
strained empirically. However, since these constraints are 
often somewhat uncertain, we will vary many of them 
as we explore parameter space in Sections |IV| and [V[ 
Our conclusions should therefore be robust to reasonable 
changes in our assumptions. 

Finally, we stress that in what follows, our models 
are comprised of two conceptually separate — but closely 
related — pieces. To understand this, note that Equation 
([2| is a random vector, both because the instrumental 
noise is sourced by random thermal fluctuations and be- 
cause the foregrounds and the cosmological signal have 
modeling uncertainties associated with them. Thus, to 
fully describe the behavior of n, we need to specify two 
pieces of information: a mean (our "best guess" of what 
the foregrounds and other noise sources look like as a 
function of frequency and angle) and a covariance (which 
quantifies the uncertainty and correlations in our best 
guess). We will return to this point in Section II B 4 when 
we summarize the essential features of our model. Read- 
ers may wish to skip directly to that section if they are 
more interested in the "designer's guide" portion of the 
paper than the mathematical details of our generalized 
noise model. 



Given that foregrounds are likely to be the largest con- 
taminant in a measurement of the global signal, it is im- 
portant to have a foreground model that is an accurate 
reflection of the actual contamination faced by an exper- 
iment, as a function of both angle and frequency. Having 
such a model that describes the particular realization of 
foregrounds contaminating a certain measurement is cru- 
cial for optimizing the foreground removal process, as 



we shall see in Section [Hlj However, constructing such a 
model is difficult to do from first principles, and is much 
more difficult than what is typically done, which is to 
capture only the statistical behavior of the foregrounds 
{e.g. by measuring quantities such as the spatial average 
of a spectral index). It is thus likely that a full fore- 
ground model will have to be based at least partially on 
empirical data. 

Unfortunately, the community currently lacks full-sky, 
low noise, high angular resolution survey data in the low 
frequency regime relevant to global signal experiments. 
Foreground models must therefore be constructed via in- 
terpolations and extrapolations from measurements that 
are incomplete both spatially and spectrally. One such 
effort is the Global Sky Model (GSM) of [19 . In that 
study, the authors obtained foreground survey data at 11 
different frequencies, and formed a series of foreground 
maps, stored in the vector g. The maps were then used 
to define a spectral covariance matrix G: 



iGSM 
'a/3 



1 ^ 

N ^ 



(3) 



where N is the number of pixels in a spectrally well- 
sampled region of the sky, and in accordance with our 
previous notation, g^i denotes the measured foregrounds 
in the i^^ pixel at the a^^ frequency channel. From this 
covariance, a dimensionless frequency correlation matrix 
was formed: 



nGSM 



r^GSMr^GSM 



(4) 
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By performing an eigenvalue decomposition of G into its 
principal components, the authors found that the spec- 
tral features of the foregrounds were dominated by the 
first three principal components, which could be used 
as spectral templates for constructing empirically-based 
foreground models. The GSM approach was found to be 
accurate to ~ 10%. 

Being relatively quick and accurate, the GSM has been 
used as a fiducial foreground model in many studies of 
the global 21cm signal to date [211 US]- However, this 
may be insufficient for two reasons. First, as mentioned 
above, the GSM approach predicts the magnitude of fore- 
grounds by forming various linear combinations of three 
principal component {i.e. spectral eigenmode) templates. 
Thus, if the GSM is considered the "true" foreground 
contribution in our models, it becomes formally possible 
to achieve perfect foreground removal simply by project- 
ing out just three spectral modes from the data. This 
is too optimistic an assumptiorj^ The other weakness of 
the GSM is that it does not include bright point sources, 
which are expected to be quite numerous at the sensitiv- 
ities of most 21cm experiments. 

In this paper, we use the GSM as a starting point, but 
add our own spectral extensions to deal with the afore- 
mentioned shortcomings. The extensions come partly 
from the phenomenological model of [44 , which in our 
notation can be summarized by writing down a matrix 
G®^* that is analogous to G^^^ defined above: 

Qext ^ Qps ^ Qsync ^ qS^ 

where G^^, G^^"^, and G^ refer to foreground contribu- 
tions from unresolved extragalactic point sources. Galac- 
tic synchrotron radiation, and Galactic free- free emission, 
respectively. Each contribution takes the generic form 

G,, = A'{^) (6) 

where A = 335.4 K, a = 2.8, and Aa = 0.1 for the 
synchrotron contribution; A = 70.8 K, a = 2.5, and 
Aa = 0.5 for the unresolved point source contribution; 
A = 33.5 K, a = 2.15, and Aa = 0.01 for the free-free 
contribution, and is a reference frequency, which we 
take to be 150 MHz. Our strategy is to perform a princi- 
pal component decomposition on this model, and to use 
its higher order principal components to complement the 
three components provided by the GSM, completing the 
basis. 



^ The reader should thus be cautious when comparing our predic- 
tions to forecasts in the hterature that use the GSM as their only 
foreground model. For identical experimental parameters, one 
should expect our work to give larger error bars. However, our 
new signal extraction algorithms (particularly those that make 
use of angular information) more than overcome this handicap, 
and our conclusions will in fact be quite optimistic. 




Eigenvalue number i 



FIG. 2. Comparison with the global sky model [19]. Our 
eigenvalues are represented by the blue circles, while those of 
the GSM are denoted by the purple squares. The GSM con- 
tains only three eigenvalues, which are in reasonable agree- 
ment with our model. 

We can check that our strategy is a sensible one by 
forming G [Equation Q] for both the GSM and our 
phenomenological model. For this test, we form the ma- 
trices for a set of observations over 70 frequency chan- 
nels, each with bandwidth 1 MHz spanning the 30 MHz to 
100 MHz frequency range (relevant to observations prob- 
ing the first luminous sources). We then compute the 
eigenvalue spectrum of both models, and the result is 
shown in Figure [2] The GSM, built from three principal 
components, contains just three eigenvalues. The first 
three eigenvalues of the phenomenological model agree 
with these values quite well, with the phenomenological 
model slightly more conservative. It is thus reasonable to 
use the higher eigenmodes of the phenomenological model 
to "complete" the foreground model spectrally. We ex- 
pect this completion to be necessary because there are 
more than three eigenvalues above the thermal noise limit 
for a typical experiment [44 . Spatially, we form an angu- 
lar model by averaging the GSM maps over all frequen- 
cies, and give all of the higher spectral eigenmodes this 
averaged angular dependence. While in general we ex- 
pect the spatial structure to be different from eigenmode 
to eigenmode, our simple assumption is a conservative 
one, since any additional variations in spatial structure 
provide extra information which can be used to aid fore- 
ground subtraction. 

Next, we add bright point sources to our model. The 
brightnesses of these sources are distributed according to 
the source count function 

dfl [ S \ — 1-75 

^ = (4sourcesmJy-^sr-i)(^g^^j ' 
and we give each source a spectrum of 

where the 5*0 is the brightness drawn from the source 
count function, and a is the spectral index, drawn from 
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a Gaussian distribution with mean 0.5 and a standard 
deviation of 0.25 [30 . Spatially, we distribute the bright 
point sources randomly across the sky. This is strictly 
speaking an unrealistic assumption, since point sources 
are known to be clustered. However, a uniform distri- 
bution suffices for our purposes because it is a conserva- 
tive assumption — any spatial clustering would constitute 
an additional foreground signature to aid foreground re- 
moval. 

Putting all these ingredients together, the result is a 
series of foreground maps like that shown in Figure [3j one 
at every frequency. This model constitutes a set of "best 
guess" foreground templates. Now, while our model is 
based on empirical data, it does possess uncertainties. A 
robust method for foreground subtraction thus needs to 
be able to account for possible errors in the generalized 
noise model. To set the stage for this, we define our 
best guess model described above to be the mean (amidst 
uncertainties in the construction of the model) of the n^^ 
vector in Equation ([2|, i.e. 



m ' 



(n 



(9) 



where m^^ is our best guess model. The covariance of 
the foregrounds is defined as the error in our foreground 
model: 



(10) 

where we have assumed that the error in our foreground 
model is proportional to the model m^^ itself, with a 
proportionality constant e between and 1. The matrices 
R and Q encode the spatial and spectral correlations in 
our foreground model errors, respectively. 

With this form, we are essentially assuming that there 
is some constant percentage error in every pixel and fre- 
quency of our model. Note that the model error e will in 
general depend on the angular resolution, for if we start 
with a high resolution foreground model and spatially 
downsample it to the resolution of our experiment, some 
of the errors in the original model will cancel out. As 
a simple, crude model, we can suppose that the errors 
in different pixels average down as the square root of the 
number of pixels. Since the number of pixels scales as the 
inverse of where is some generic angular resolution, 
we have 



paper. Our foreground covariance thus becomes 

Nl%, = el ' ra%ra%IL,,Q^,. (12) 

Since the angular structure of our foreground model is 
based on that of the GSM, which conservatively has a 
10% accuracy at 5° [19 , we use eo =0.1 and Ofg = 5° as 
fiducial values for this paper. 

To capture spatial correlation^ in our foreground 
modeling errors, we choose the matrix R to correspond 
to the continuous kernel 



R(r,r') 



47r exp(cr ^r-f') 
A^pix 47rcr2 sinh(cr-2) " 



(13) 



Aside from the constant 47r/A/pix factor (which is needed 
to make the discrete and continuous descriptions consis- 
tent [45]|46]), this is known as a Fisher function, the ana- 
log of a Gaussian on a sphere. The quantity a measures 
the spread of the kernel, and since the GSM's spectral 
fits were performed on pixels of roughly 5° resolution, we 
set (J = 5° for this worl^ 

For the spectral correlation matrix Q, suppose we 
imagine that our foreground model was constructed by 
spectrally fitting every pixel of a foreground survey to a 
power law of the form 



(14) 



where A is a normalization constant that will later can- 
cel out, a is a spectral index, and z/* is a reference fre- 
quency for the fits, which we take to be 150 MHz for the 
30 to 100 MHz observations targeting the first luminous 
sources, and 50 MHz for the 100 to 250 MHz observations 
targeting reionization. (The reference frequency is some- 
what arbitrary, and in practice one would simply adjust 
it to get the best possible fits when constructing one's 
foreground model). The spectral index will have some 
error associated with it, due in part to uncertainties in 
the foreground survey and in part to the fact that fore- 
ground spectra are not perfect power laws. We model this 
error as being Gaussian distributed, so that the proba- 
bility distribution of spectral indices is given by 



1 



exp 



1 (a 
2 



(15) 



^0- 



(11) 



where 6^5 is the angular resolution of our current experi- 
ment, 6>fg is the "native" angular resolution of our fore- 
ground model, and Sq is the fractional error in our model 
at this resolution. Equation ( pTj ) implicitly assumes a 
separation of scales, where we deal only with instruments 
that have coarser angular resolution that the correlation 
length of modeling errors specified by R (described be- 
low). We will remain in this regime for the rest of the 



^ We emphasize that the spatial correlations encoded by R are 
spatial correlations in the foreground error, not correlations in 
the spatial structure of the foreground emission itself. The spa- 
tial structure of the foregrounds are captured by the m^^ terms 
of Equations [9] and and will typically be correlated over much 
larger scales than the errors. The data analysis formalism that 
we present in Section |III| will take into account both types of 
correlation. 

^ Note that a does not necessarily have to be equal to ^fg. For 
instance, if one's foreground model is based on a survey with 
some instrumental beam size that is oversampled in an attempt 
to capture all the features in the map, one would be in a situation 
where ^fg < cr. 



FIG. 3. Foreground template at 79 MHz. The color scale is linear and has been capped at 6000 K to bring out visual contrast 
even though much of the sky is far brighter. 



where ao is a fiducial spectral index for typical fore- 
grounds, which in this paper we take to be 2.5. The 
parameter ctq, controls the spectral coherence of the fore- 
grounds, and we choose the rather large value of = 1 
to be conservative. 

With this, the mean spectral fit to our foreground sur- 
vey is 

{t{u)) =aJ (^^^ p{a)da. (16) 

Sampling this function at a discrete set of frequencies 
corresponding to the frequency channels of our global 
signal experiment, we can form a mean vector (t). A 
covariance matrix C®^^^®^ of the power law fits is then 
given by 

^survey ^ ^^^^^ _ ^^^^^^^^ (^7) 

where t is a discretized version of t{u)^ and 

{tt')p^=A^ j (^^) \{a)da. (18) 

Finally, we take the covariance C^^^^®^ and insert it into 
the left hand side of Equation Q (just as we did with 
Ggsm earlier) to form our spectral correlation matrix Q. 
Note that the normalization constant A cancels out in 
the process, as we claimed. 

In Figure |4j we show the eigenvalues of Q, and in 
Figure [Sj the first few eigenmodes. Just as with the 
foregrounds themselves, the roughly exponential decay 
of the eigenvalues show that the foreground correlations 
are dominated by the first few eigenmodes, which are 
smooth functions of frequency. 
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FIG. 4. First few eigenvalues of the spectral correlation ma- 
trix Q. The eigenvalues decay roughly exponentially, which 
means that the foreground correlations are dominated by the 
first few eigenmodes. 



2. Instrumental Noise 



We model the instrumental noise nj^^* in every pixel 
and every frequency channel as uncorrelated. Addition- 
ally, we make the assumption that there are no system- 
atic instrumental effects, so that (n-^^*) = 0. What re- 
mains is the random contribution to the noise. Assuming 
a sky- noise dominated instrument, the amplitude of this 
contribution is given by the radiometer equation. In our 
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FIG. 5. First few eigenvectors of the spectral correlation 
matrix Q. These spectral eigenmodes are seen to be quite 
smooth, and have increasing structure as one goes to higher 
modes. In conjunction with Figure [4] this shows that the 
frequency correlations in the foreground are dominated by 
smooth spectral eigenmodes. 



notation, this gives rise to a covariance of the fornj^ 



T^inst 
■'■^ ai(3j 



AtAu 



where At is the integration time per pixel and Au is the 
channel width. 

The instrumental noise is different from all other sig- 
nal sources in an experiment (including the cosmological 
signal and all other forms of generalized noise) in that 
it is the only contribution to the measurement that is 
added after the sky has been convolved by the beam 
of an instrument. All the "other" contributions should 
be convolved with the beam. For instance, if an instru- 
ment has a Gaussian beam with a standard deviation ^5, 
then these contributions to the covariance are multiplied 
by ex.p[—Ol£{£-\- 1)] in spherical harmonic space [46( 147) . 
However, since the instrumental noise is the only part of 
our measurement that does not receive this correction, 
it is often convenient to adopt the convention that all 
measured maps of the sky are already deconvolved prior 
to the main data analysis steps. This allows us to retain 
all of the expressions for the cosmological signal and the 
foregrounds that we derived above, and to instead mod- 
ify the instrumental noise to reflect the deconvolution. 
In this paper, we will adopt the assumption of already- 
deconvolved maps. 

If the instrumental noise contribution were rotation- 
invariant on the sky (as is the case for many Cosmic Mi- 



^ In principle, the cosmological signal also contributes to the sky- 
noise, and so there is in fact some cosmological information in the 
noise. For the noise term only, we make the approximation that 
the cosmological signal contributes negligibly to the sky temper- 
ature, since the foregrounds are so much brighter. For any rea- 
sonable noise level, this should be an excellent assumption, and 
in any case a conservative one, as extra sources of cosmological 
information will only serve to tighten observational constraints. 



crowave Background experiments), modifying the noise 
to reflect deconvolution would be simple. One would 
simply multiply the instrumental noise covariance by 
exp[^^^(^+ 1)] in spherical harmonic space [46, 47 . Un- 
fortunately, since we are assuming a sky-noise domi- 
nated instrument, the assumption of rot at ion- invar iance 
breaks down, thanks to structures such as the Galac- 
tic plane, which clearly contributes to the sky signal in 
an angularly-dependent way. The simple prescription of 
multiplying by exp[6>^£(-^ + 1)] thus becomes inadequate, 
and a better method is required. 

Suppose we work in a continuous limit and let 6(r,r') 
be an integration kernel that represents a Gaussian in- 
strumental beam, so that 

T^°""(f) = J b{r,r')T{r')dn' 

= ^r,^(r)e-^^^^(^+^)/^ / r/^(rOT(fO^^^(20) 

£,m 

where T^'=>^^(j^ is convolved version of the original sky 
T(r) and Yim are the spherical harmonics. With this, our 
effective {i.e. deconvolved) instrumental noise covariance 
is given by 

m^^{r2)b-\r2,r')dnidn2. (21) 

where m^^ is the continuous version of m^^, and the 
deconvolution kernel (given by the operator inverse of 
6), which in spherical harmonic space simply multiplies 
by e^^^^^+^^/^. For notational simplicity, we have omit- 
ted constants and suppressed the frequency dependence 
(since it has nothing to do with the spatial deconvolu- 
tion). Now, suppose we make the approximation that 
the foregrounds are spatially smooth and slowly varying. 
In real space, 6~^(f,f') is a rapidly oscillating function 
that is peaked around f = r ^ We may thus move the 
two copies of m^^ outside the integral, setting r = r 1 and 
f = f2: 

= mfs(f)mf^(rO ^e^^'^(^+i)y/^(r)r,^(rO. (22) 

If the m^s(r)m^^(f') terms were not present, this would 
be equivalent to the "usual" prescription, where the effec- 
tive noise kernel involves transforming to spherical har- 
monic space, multiplying by e^f'^^^+^^Z^, and transforming 
back. Here, the prescription is similar, except the kernel 
is modulated in amplitude by the sky signal. 

In conclusion, we can take instrumental beams into 
account simply by replacing N^"®* with an effective in- 
strumental noise covariance of the form 



inst,eff 



(23) 
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where 'Daifdj = ^ai^a(3^ij^ and N^®^ is a deconvolved 
white noise covariance {i.e. one that is multiphed by 
g6't,£(£+i) spherical harmonic space) with proportional- 
ity constant l/AtAu. 

In deriving our deconvolved noise covariance, we made 
the assumption that the sky emission is spatially smooth 
and slowly- varying. This allowed us to treat the decon- 
volution analytically even in the case of non- white noise, 
and in Section [lIIB| will allow us to analytically derive an 
optimal estimator for the cosmological signal. While our 
assumption of smooth emission is of course only an ap- 
proximation, we expect it to be a rather good one for our 
purposes. The only component of the sky emission where 
smoothness may be a bad assumption is the collection of 
bright point sources. However, we will see in Section 



IIIB that the optimal estimator will heavily downweight 
brightest regions of the sky, so extremely bright point 
sources are effectively excluded from the analysis any- 
way. 



3. Cosmological Anisotropy Noise 

In measuring the global signal, we are measuring the 
monopole contribution to the sky. As mentioned above, 
any anisotropic contribution to the cosmological power 
is therefore a noise contribution as far as a global signal 
experiment is concerned. By construction, these non- 
monopole contributions have a zero mean after spatially 
averaging over the sky, and thus do not result in a sys- 
tematic bias to a measurement of the global signal. They 
do, however, have a non-zero variance, and therefore con- 
tribute to the error bars. 

Although it is strictly speaking non-zero, we can safely 
ignore cosmological anisotropy noise because it is negligi- 
bly small compared to the foreground noise. Through a 
combination of analytic theory [48 and simulation anal- 
ysis [38], the cosmological anisotropics have been shown 
to be negligible on scales larger than ~ 1° to 2°, which 
is a regime that we remain in for this paper. 



4. Generalized noise model summary 

In the subsections above, we have outlined the vari- 
ous contributions to the generalized noise that plagues 
any measurement of the global signal. Of these contri- 
butions, only foregrounds have a non-zero mean, so the 
mean of our generalized noise model is just that of the 
foregrounds: 



(n) 



m ' 



(24) 



Foregrounds therefore have a special status amongst the 
different components of our generalized model, for they 
are the only contribution with the potential to cause a 
systematic bias in our global signal measurement. The 



other contributions appear only in the total noise covari- 
ance, taken to be the sum of the foreground covariance 
and the effective instrumental noise covariance: 



^ai(3j = N; 



ai(3j 



-N 



inst,efr 
ai(3j ' 



(25) 



where as noted above, we are neglecting the cosmological 
anisotropy noise. 

In the foreground subtraction/data analysis scheme 
that we describe in Section [TITj we will think of the mean 
m as a foreground template that is used to perform a first 
subtraction. However, there will inevitably be errors in 
our templates, and thus our scheme also takes into ac- 
count the covariance N of our model. In our formalism, 
the mean term therefore represents our best guess as to 
what the foreground contamination is, and the covariance 
quantifies the uncertainty in our guess. We note that this 
is quite different from many previous approaches in the 
literature, where either the foreground modeling error is 
ignored {e.g. when the foreground spectra are assumed 
to be perfect polynomials), or the mean is taken to be 
zero and the covariance is formed by taking the ensem- 
ble average of the outer product of the foreground tem- 
plate error. The former approach is clearly unrealistic, 
while the latter approach has a number of shortcomings. 
For example, it is difficult to compute the necessary en- 
semble average, since foregrounds are difficult to model 
from first principles, and empirically the only sample that 
we have for taking this average is our Galaxy. As a so- 
lution to this, ensemble averages are often replaced by 
spatial {i.e. angular) averages. But this is unsatisfactory 
for our purposes, since in Section IIIB| we will be us- 
ing the angular structure of foregrounds to aid with fore- 
ground subtraction, and this is impossible if the informa- 
tion has already been averaged out. Even if an ensemble 
average could somehow be taken (perhaps by running a 
large suite of radiative transfer foreground simulations), 
a foreground subtraction scheme that involved minimiz- 
ing the resulting variance would be non-optimal for two 
reasons. First, in such a scheme one would be guarding 
against foreground power from a "typical" galaxy, which 
is irrelevant — all that matters to an experiment are the 
foregrounds that are seen in our Galaxy, even if they are 
atypical. In addition, foregrounds are not Gaussian dis- 
tributed, and thus a minimization of the variance is not 
necessarily optimal. 

Our approach — taking the mean to be an empirical 
foreground template and the covariance to be the errors 
in this template — solves these problems. Since the covari- 
ance arises from measurement errors (which can usually 
be modeled to an adequate accuracy), taking the ensem- 
ble average is no longer a problem. And with the mean 
term being a template for the foregrounds as seen by 
our experiment, our foreground model is tailored to our 
Galaxy, even if our Galaxy happens to be atypical. Fi- 
nally, while the foregrounds themselves are certainly not 
Gaussian, it is a much better approximation to say that 
the uncertainties in our model are Gaussian, at least if 
the uncertainties are relatively small. Constructing our 
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FIG. 6. A comparison of the absolute value of the expected 
theoretical signal spectrum (blue circles) and the mean fore- 
ground contamination (purple squares). This highlights the 
difficulty of global 21 cm signal experiments: the foregrounds 
are many orders of magnitude brighter than the signal we seek 
to measure. 



foreground model in this way thus allows us to take ad- 
vantage of the optimal data analysis techniques that we 
introduce in Section ITTIl 



C. Why it's hard 

Before we proceed to describe how the global 21 cm 
signal can be optimally extracted, we pause to describe 
the challenges aheaqj As an initial "straightforward" 
approach, one can imagine measuring the global 21 cm 
signal by taking a simple spatial average of a measured 
sky. The corresponding foreground contamination would 
be obtained by spatially averaging our model, which is 
shown in Figurej6| along with our expected theoretical sig- 
nal from Figure]^ A straightforward measurement would 
thus be completely dominated by the bright foregrounds. 
In addition, both the foreground contamination and the 
theoretical signal are smooth as a function of frequency, 
making it difficult to use foreground subtraction tech- 
niques that have been proposed for tomographic maps, 
where the cosmological signal is assumed to vary much 
more rapidly as a function of frequency than the fore- 
grounds. It is therefore crucial that optimal foreground 
cleaning methods are employed in the analysis, and in 
the following section we derive such methods. 



Not included in this paper is the fact that an instrument might 
have a non-trivial frequency response that needs to be calibrated 
extremely well. In principle, if one has 1) sufficiently good in- 
strumental calibration and 2) an exquisitely accurate foreground 
model, then it will always be able to pick out a small cosmolog- 
ical signal from beneath the foreground sources, however bright 
they might be. In this paper we concentrate on lessening the 
second requirement by proposing reliable foreground subtraction 
algorithms. Tackling the problem of calibration is beyond the 
scope of this paper, but encouraging progress has recently been 
made in engineering tests |37| . 



III. A FRAMEWORK FOR GLOBAL SIGNAL 
MEASUREMENTS 

In this section, we develop the mathematical frame- 
work for analyzing data from global signal experiments. 
We begin with a measurement equation. For an experi- 
ment with both spectral and angular sensitivity, we have 



y 



(26) 



where y is a vector of length A/vox = ^pix^freq containing 
the measurement, n is the generalized noise contribution 



of Section II B , Xg is a vector of length TVfreq containing 
the global signal that we wish to measure, and A is a ver- 
tical stack of A/pix identity matrices of size A/'f^eq x N^req_- 
The effect of multiplying the global signal by A is to 
copy the theoretical spectrum to every pixel on the sky 
before the noise and foregrounds are added. The term 
Axg therefore represents a data balQthat contains ideal 
(noiseless and foregroundless) data that depends only on 
the radial distance from the center, and not on the di- 
rection. To every voxel of this data volume the com- 
bined noise and foreground contribution n is added, giv- 
ing the set of measured voxel temperatures that comprise 
y. Note that by thinking of the measurement vector as 
a set of voxel temperatures, we have implicitly assumed 
that a prior mapmaking step has been performed on the 
raw time-ordered data. This step should ideally take into 
account instrumental complications such as instrumental 
beam sidelobes. 

For measurements with no angular sensitivity, we can 



define z = -^^^A^y to be the globally averaged mea- 
surement. In this notation, our measurement equation 
becomes 



1 



z X. 



-A^n = X5 + c. 



(27) 



pix 



where c = -^^^A*n is the angularly averaged noise and 



foreground contribution, with mean 



1 



-A*(n) 



Nr. 



-A*m, 



(28) 



where in the last step we used the definition in Equation 
(24), and covariance 



(cc*) - {c)(c*) 



1 



A^2 

pix 



A*NA. 



(29) 



Our goal is to derive a statistically optimal estimator 
Xg for the true global signal Xg. With an eye towards 
optimizing experimental design, we will construct opti- 
mal estimators for both measurement equations, treating 



^ Or perhaps a "data shell", since there is a lower limit to the 
redshift of the experiment. 



11 



the spectral-only measurements in Section III A| and the 
spectral-plus-angular measurements in Section IIIB| We 
will prove that beyond simply subtracting a best-guess 
model for the foregrounds, there is formally nothing that 
can be done to mitigate foreground residuals by using 
spectral information only. In contrast, adding angular 
foreground information partially immunizes one from er- 
rors in the best-guess model, and allows the final error 
bars to be reduced. 



A. Methods using only spectral information 

For methods that use only spectral information, we 
write down an arbitrary linear estimator of the form 



X, = Mz - d, 



(30) 



where M is an A/'freq x A/'freq matrix and d is a vector of 
length A^freq, whose forms we will derive by minimizing 
the variance of the estimator. 

Taking the ensemble average of our estimator and in- 
serting Equation (pTl gives 



(31) 



(x,) = Mx, + M(c) - d, 



which shows that in order for our estimator to avoid hav- 
ing a systematic additive bias, one should select 



d = M{c). 



(32) 



With this choice, we have (xg) = Mx^. The variance of 
this estimator can be similarly computed, yielding 

S = (x,x* ) - (x, ) (x* ) = MCM* . (33) 

We can minimize this variance subject to the normaliza- 
tion constraint M-^a = 1 by using Lagrange multipliers. 
To do so we minimize 



(MCM^),, - A«M 



(34) 



with respect to the elements of M. Taking the necessary 
derivatives and solving for the Lagrange multiplier A that 
satisfies the normalization constraint, one obtains 



(35) 



Inserting this into our general form for the estimator, we 
find 



(36) 



(C-i) 



^[C-(z-(c))]^. 



In words, this prescription states that one should take 
the data, subtract off the known foreground contami- 
nation, and then inverse variance weight the result be- 
fore re-weighting to form the final estimator. The in- 
verse variance weighting performs a statistical suppres- 
sion/subtraction of instrumental noise and foreground 



contamination. Loosely speaking, this step corresponds 
to the subtraction of polynomial modes in the spectra 
as simulated in [211 SI] and implemented in [22]. The 
final re-weighting by rescales the modes so that the 
previous subtraction step does not cause the estimated 
modes to be biased high or low. For instance, if a certain 
mode is highly contaminated by foregrounds, it will be 
strongly down-weighted by the inverse variance weight- 
ing, and thus give an artificially low estimate of the mode 
unless it is scaled back up. 

The corresponding measurement error covariance for 
this estimator is given by 



S„^^{X^X^)-{x?)(xf) 



(C-l)a 



(C-i)«„(C-i)^/ 



(37) 



It is instructive to compare this with the errors predicted 
by the Fisher matrix formalism. By the Cramer-Rao in- 
equality, an estimator that is unwindowed {i.e. one that 
has (xg) = Xg) will have a covariance that is at least as 
large as the inverse of the Fisher matrix. Computing the 
Fisher matrix thus allows one to estimate the best possi- 
ble errors bars that can be obtained from a measurement. 
In the approximation that fiuctuations about the mean 
are Gaussian, the Fisher matrix takes the form 

F^p = ixr [C-iC,„C-iC,^] + {z)t„C-i{z),^, (38) 

where commas denote derivatives with respect to the pa- 
rameters that one wishes to measure. In our case, the 
goal is to measure the global 21 cm spectrum, so the pa- 
rameters are the values of the spectrum at various fre- 
quencies. Put another way, we can write our mean mea- 
surement equation as 



(c), 



(39) 



where is a unit vector with O's everywhere except for 
a 1 at the a^^ frequency channel. The derivative {z)^^ 
with respect to the a^^ parameter (i.e. the derivative 
with respect to the mean measured spectrum x^ in the 
^th freqijency channel) is therefore simply equal to Gq,. 
Since the measurement covariance C does not depend on 
the cosmological signal x^, our Fisher matrix reduces to 



Faf3 = e^C 



(40) 



which shows that the Fisher matrix is simply the in- 
verse covariance i.e. F = C~^. This implies that the 
covariance of the optimal unwindowed method is equal 
to the original noise and foreground covariance, which 
means that the error bars on the estimated spectrum are 
no smaller than if no line-of-sight foreground subtraction 
were attempted beyond the initial removal of foreground 
bias [Equations (31) and (32)]. In our notation, an un- 
windowed estimator would be one with M = I [from 
Equation ([3l]) plus the requirement that the estimator 
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be unbiased, i.e. Equation (32)]. But this means our 
optimal unwindowed estimator is 



giving up on the unwindowed requirement, the estimator 
can evade the Cramer-Rao bound). However, this reduc- 



(c), 



(41) 



which says to subtract our best-guess foreground spec- 
trum model and to do nothing more! 

To perform just a direct subtraction on a spectrum 
that has already been spatially averaged and to do noth- 
ing else [as Equation ( [41] ) suggests] is undesirable, be- 
cause the error covariance in our foreground model is 
simply propagated directly through to our final measure- 
ment covariance. This would be fine if we had perfect 
knowledge of our foregrounds. Unfortunately, uncertain- 
ties in our foreground models mean that residual fore- 
grounds can result in rather large error bars. As an exam- 
ple, suppose we plugged our foreground covariance from 
Section |IIB 1| into the spatially averaged covariance C 
[Equation (|29[)]. If we optimistically assume completely 
uncorrelated foreground model errors (so that they aver- 
age down), one obtains an error bar of 11 K at 79 MHz. 
This is far larger than the amplitude of the cosmological 
signal that one expects. 

If one uses the minimum variance estimator [Equation 



(36)] instead, one can reduce the error bars slightly (by 



tion is purely cosmetic, for Equations 36 and 41 differ 
only by the multiplication of an invertible matrix. There 
is thus no difference in information content between the 
two estimators, and the minimum variance estimator will 
not do any better when one goes beyond the measured 
spectrum to constrain theoretical parameters. 

Intuitively, one can do no better than a "do noth- 
ing" algorithm because the global signal that we seek to 
measure is itself a spectrum (with unique cosmological 
information in every frequency channel), so a spectral- 
only measurement provides no redundancy. With no re- 
dundancy, one can only subtract a best-guess foreground 
model, and it is impossible to minimize statistical errors 
in the model itself. 

The key, then, is to have multiple, redundant mea- 
surements that all contain the same cosmological signal. 
This can be achieved by designing experiments with an- 
gular information {i.e. ones that do not integrate over 
the entire sky automatically). Because the global signal 
is formally a spatial monopole, measurements in differ- 
ent pixels on the sky have identical cosmological contri- 
butions, but different foreground contributions, allowing 
us to further distinguish between foregrounds and cos- 
mological signal. 



B. Methods using both spectral and angular information 



For an experiment with both spectral and angular information, the relevant measurement equation is Equation 



(26), and the minimum variance, unwindowed estimator for the signal can be shown [49^ to take the form 



(42) 



(43) 



%s = [A*N-iA] A*N-^(y - m). 
With this estimator, the error covariance can also be shown to be 

S^{x,x*)-{x,)(x*)= [A*N-iA]-\ 

These equations in principle encode all that we need to know for data analysis, and even allow for a generalization 
of the results that follow in the rest of this paper. For example, frequency-dependent beams (a complication that 
we ignore, as noted earlier) can be incorporated into the analysis by making suitable adjustments to N. Recall from 
Section |IIB 2| that our convention is to assume that all data sets have already been suitably deconvolved prior to 
our analysis. Taking into account a frequency-dependent beam is therefore just a matter of including the effects of 
a frequency-dependent deconvolution in the generalized noise covariance. For the Gaussian beams considered in this 
paper, for instance, we can simply make the 6^5 parameter [in Equation (22)] a function of frequency. 

Despite their considerable fiexibility, we will not be using Equations 42 and 43 in their most general form. For the 
rest of the paper, we will make the approximation of frequency-independent beams, which allows us to make some 
analytical simplifications. Doing so not only allows us to build intuition for what the matrix algebra is doing, but 
also makes it computationally feasible to explore a wide region of parameter space, as we do in later sections. In its 
current form. Equation (42) is too computationally expensive to be evaluated over and over again because it involves 
the inversion of N, which is an A/vox x ^vox niatrix. Given that A^vox is likely to be quite large for a 21cm survey 
with full spectral and angular information, it would be wise to avoid direct matrix inversions. 

We thus seek to derive analytic results for = [A^N~^A]~^ and A^N~^, where N is given by Equation (25). We 
begin by factoring out the foreground templates from our generalized noise covariance, so that N = DND, wheraj 



Recall that throughout this paper, we use Greek indices to sig- ^^^^^^^ g^g^^fy ^^g^l^^ dimensions, 

nify the spectral dimension (or spectral eigenmodes) and Latin 
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D 



^ai^a/3^iji just as we defined in Section II B 2 This is essentially a whitening procedural making the 



generalized noise independent of frequency or sky pixel. Since both 5] = [A^N ^A] ^ and A^N ^involve N 
we proceed by finding = D~^N~^D~^, and to do so we move into a diagonal basis. The matrix D is already 
diagonal and easily invertible, so our first step is to perform a similar ity t ransformation in the frequency- frequency 



components of N in order to diagonalize Q (which, recall from Section 
foregrounds). In other words, we can write N as 



II B 



quantifies the spectral correlations of the 



N. 



ai(3j 



(VNv^; 



aii3j 



^ ^ ^ a.ir]k 
rjXkm 



dec 



^r]X^ I3j Xm i 



(44) 



th 



where Y^ir^k = {'^r])a^ik^ with (v^)ck signifying the value of the a component {i.e. frequency channel) of the 77 
eigenvector of Q. The 77*^ eigenvalue is given by A^. Note also that = V^. 

Our next step is to diagonalize R, the spatial correlations of the foregrounds, as well as N^®^, which as the 
whitened instrumental noise covariance is spatially correlated after the data has been deconvolved. If we assume 
that the correlations are rotationally invarianlp^ these matrices will be diagonal in a spherical harmonic basis. For 
computational convenience, we will now work in the continuous limit. As discussed above and shown in ^^^46^, this 
is entirely consistent with the discrete approach provided one augments all noise covariances with a factor of 47r/A^pix. 
In the continuous limit, the spatial correlations of the foregrounds therefore take the form 



47r exp((j ^r-f') 
iVp~47r<T2 sinh(a-2)' 



(45) 



which (except for our 47r/A^pix factor) is known as a Fisher function, the analog of a Gaussian on a sphere. For 
cr <C 1 rad, this reduces to the familiar Gaussian: 



i?(f,f') 



47r 1 



exp 



10^ 



(46) 



where 6 = arccos(f •?') is the angle between the two locations on the sphere. Switching to a spherical harmonic basis, 
we have 



Rf 



47r 



d^d^'Yl^{v)R{v,v')Y,>m'{v') 



47r 
AW 



exp 



1 



1) 



(47) 



where Y^rn denotes the spherical harmonic with azimuthal quantum number ^ and magnetic quantum number m, and 
the last approximation holds if <j <C 1 rad. 

For the instrumental noise term, we saw in Section II B 2 that after dividing out the instrument's beam, we have 

47r e^'^(^+^ 



dec 



and adding this to the foreground contribution gives us the equivalent of N but in spherical harmonic space: 



N 



imrjl'm' X 



47r 



Ob 



AtAiy 



(48) 



(49) 



The diagonal nature of N in this equation allows a straightforward inversion: 



(N )imr]l' m' X 



N ■ 

^pix 

47r 



AtAu 



mm' ^'qXi 



(50) 



^ In what follows, N, N, N, and N all refer to the same matrix, but 
use different unit conventions and/or are expressed in different 
bases. The original generalized noise N is assumed to be in "real 
space" i.e. frequency and spatial angles on the sky; N is in the 
same basis, but is in units where the foreground model has been 
divided out; N is the same as N, but in a spatiaj^ angle and 
spectral eigenforeground basis; N is the same as N, but in a 
spherical harmonic and spectral eigenforeground basis. 

^ Recall that R encodes the spatial correlations in the errors of 



our foreground model. It is thus entirely possible to break rota- 
tion invariance, for instance by using a foreground model that is 
constructed from a number of different surveys, each possessing 
different error characteristics and different sources of error. For 
this paper we ignore this possibility in order to gain analytical 
intuition, but we note that it can be corrected by finding the 
eigenvectors and eigenvalues of R, just as we did with Q. 
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thus allowing us to write the inverse matrix in the original spatial basis as 

2 

/ V airjim 



(N )ail3j 



47r 



2 / ^fg 



(51) 

where ^fsimai = ^ajs^imi and Y is a unitary matrix that transforms from a real-space angular basis to a spherical 

harmonic basis. Obtaining the inverse of N from here is done by evaluating = VN ^V^. 

We are now ready to assemble the pieces to form = [A^N~^A]~^ which, in the notation of our various changes 

of basis can be written as [A^D~-^F'''VN V^FD~-^A]~-^. We first compute 



(V^FD ^)alrapj = lmj{^ a) ^ = lmj{^ a) p^^j 



(52) 



where \X(^i = l/niai is the reciprocal temperature (in units of K~^) at the a*^ frequency and the i^^ pixel of our 
foreground templates. Note that in the above expression, there is no sum over j yet. This is accomplished by the 
angular summation matrix Aaiis = 5^^^ giving 



(V^FD ^A.)o,lm^ = (Va)/3 ^ Y£mjU/3j = {^a)^^^ 



(53) 



where there is similarly no sum over /?, and u signifies our reciprocal foreground templates in spherical harmonic 
space. The inverse covariance is thus given by 



im 



c-2 ( ^jA \ £{£+!) I 



(54) 



Defining tint = N^i^At to be the total integration time over the survey , and making the substitution TVpix = 47r/6>; 



gives 



^)«/3 = ^X!K)a(v^)/3^■ 



UQ;£mU/3£m 



tint Ai/ 



(55) 



At this point, notice that the angular cross-power spectrum C^^ between two reciprocal maps u^^ and u^^i at 
frequencies a and /3 respectively is given by 



1 ^ 



(56) 



where the "ii" superscript serves to remind us that (7^'"^ is the cross-power spectrum of the reciprocal maps, not the 
original foreground templates. With this, our expression for the inverse measurement covariance can be written 



= ^X^K)a(v.^)^^ 



2^ + 1 



47r 
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u,a(3 



(57) 



Permuting the sums and recalling that and are the r]^^ eigenvalue and eigenvector of our foreground spectral 
correlation matrix Q respectively, this expression can be further simplified to give 



£=0 



47r tmt^V 



(58) 



That is, refers to the amount of integration time spent by 
a single beam on a small patch of the sky with area equal to 
our pixel size, and tint refers to the total integration time of a 



single beam scanning across the entire sky. An experiment capa- 
ble of forming A/'beams independent beams simultaneously would 
require one to replace tint with Abeams^int in onr formalism. 
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This provides a fast prescription for computing 5] = [A*N~^A]~^. One first inverts a series of relatively small 
matrices (each given by the expression in the square brackets) . These inversions do not constitute a computationally 
burdensome load, for a and /3 are frequency indices, so the matrices are of dimension A/f^eq x A/'freq. One then uses 
publicly available fast routines for computing the angular cross-power spectrum, multiplies by {2i -\- l)/47r and the 
inverted matrices, and sums over £. The resulting matrix is then inverted, a task that is much more computationally 
feasible than a brute-force evaluation of X), which would involve the inversion of an A^vox x A^vox matrix. 

Using essentially identical tricks, we can also derive a simplified version of our estimator Xg [Equation (42)]. The 
result is 



(59) 



where y and m are continuous versions of the measured sky signal y and the foreground model m respectively, and 
the weights w are defined as 

I 



47r 



(60) 



In words, this estimator calls for the following data 
analysis recipe: 

1. Take the measured sky signal yij^f) and subtract 
the best-guess foreground model m(r, v). 



2. 



Downweight regions of the sky that are believed to 
be heavily contaminated by foregrounds by dividing 
by our best-guess model m{r^u). [This and the 
previous step of course simplify to give y/m — 1^ as 
we wrote in Equation (59)]. 



3. Express the result in a spherical harmonic and spec- 
tral eigenmode basis. 

4. Take each mode (in spherical harmonic and fre- 
quency eigenmode space) and multiply by weights 

5. Transform back to real (angle plus frequency) 
space. 

6. Divide by m(r, u) to downweight heavily contami- 
nated foreground regions once more. 

7. Sum over the entire sky to reduce the data to a 
single spectrum. 

8. Normalize the spectrum by applying 5] = 
[A^N~-^A]~-^ [the inverse of Equation (58)] to en- 
sure that modes that were heavily suppressed prior 
to our averaging over the sky are rescaled back to 
their correct amplitudes. (Note that the error bars 
will also be correspondingly rescaled so that heavily 
contaminated modes are correctly labeled as lower 
signal-to-noise measurements). 

The recipe outlined here takes full advantage of spec- 
tral and angular information to mitigate foregrounds and 
produce the smallest possible error bars under the con- 
straint that there be no multiplicative or additive bias 
i.e. under the constraint that (x^) = x^. To see how 



this recipe works intuitively, we can plot the weights w^^ 
to see how our optimal prescription uses various spher- 
ical harmonic i and eigenforeground rj modes. This is 
shown in Figure [Tj At low spherical harmonic ^, the first 
few {i.e. smoothest) eigenmodes are dominated by fore- 
grounds, so they are severely downweighted. At high 
the limited angular resolution of our instrument means 
that the instrumental noise dominates regardless of spec- 
tral eigenmode, so all eigenmodes are weighted equally 
in an attempt to average down the instrumental noise. 
At high T] (spectrally unsmooth eigenmodes), the fore- 
ground contamination is negligible, so the weightings are 
dictated entirely by the instrumental noise, with the only 
trend being a downweight ing of the noisier high £ modes. 



It is important to emphasize that these weights are ap- 
plied to whitened versions of the data, i.e. data where 
the best-guess foreground model was divided out in Step 
2 above. If this weren't the case, the notion of weights 
as a function of £ would be meaningless, for our goal 
is to estimate the monopole (in other words, the global 
signal), so all that would be required would be to pick 
the £ = component. The monopole in our original 
data is a linear combination of different £ modes in the 
whitened units, and the weights tell us what the optimal 
linear combination is, taking into account the interplay 
between foreground and instrumental noise contamina- 
tion. In contrast, the spectral-only methods are a "one 
size fits all" approach with no whitening and a simple 
weighting that consists solely of the £ = row of Figure 
[7| ignoring the fact that at some (^, 77) the foregrounds 
dominate, whereas at others the instrumental noise dom- 
inates. 



The rest of this paper will be dedicated to examining 
the error properties of our optimal estimator. The goal is 
to gain an intuitive understanding for the various trade- 
offs that go into designing an experiment to measure the 
global 21 cm spectrum. 
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FIG. 7. Weights [Equation ( |60[)] i n spherical harmonic ^ and eigenmode 77 space for combining data in our optimal estimator 
of the global signal [Equation (|59[)]. Low 77 and high ^ modes are downweighted because of contamination by foregrounds and 
instrumental noise, respectively. 



C. Making the cosmological signal more apparent 



In Sections |IVB| and |V| we will find that despite our 
best efforts at foreground mitigation, the resulting er- 
ror bars on our measured signal can sometimes still be 
large, particularly for instruments that lack angular res- 
olution. These errors will be dominated by residual fore- 
grounds at every frequency, and are unavoidable without 
an exquisitely accurate foreground model. The resulting 
measurements will thus look very much like foreground 
spectra. 

Often, we will find that this happens even when the de- 
tection significance is high. This apparent paradox can 
be resolved by realizing that often the large errors are 
due to a small handful of foreground modes that domi- 
nate at every single frequency channel. Put another way, 
the contaminants are fundamentally sparse in the right 
basis, since there are a small number of modes that are 
responsible for most of the contamination. Plotting our 
results as a function of frequency is therefore simply a 
bad choice of basis. By moving to a better basis (such 
as one where our measurement covariance X) is diagonal, 
as we will discuss in Section IV B), it becomes appar- 



significance. 

Still, it is somewhat unfortunate that the frequency 
basis is a bad one, as the cosmological signal is most eas- 
ily interpreted as a function of frequency. It would thus 
be useful to be able to plot, as a function of frequency, a 
measured spectrum that is not dominated by the largest 
amplitude foreground modes, even if we are unable to re- 
move them. Essentially, one can arrive at a measurement 
that is closer to the "true" cosmological signal by simply 
giving up on certain modes. This means resigning oneself 
to the fact that some modes will be forever lost to fore- 
grounds, and that one will never be able to measure those 
components of the cosmological signal. As discussed for 
an analogous problem in [44 , this can be accomplished 
by subjecting our recovered signal to a Wiener filter W: 



^Wiener 



S[S + S]-ixs = WXs 



(61) 



ent that the cosmological signal can be detected to high 



where S is the signal covariance matrix and Xg is 
our estimator from Equation (59). Roughly speak- 
ing, this amounts to weighting the data by "signal 
over signal plus noise" , which means well- measured (low 
noise/foreground) modes are given a weight of unity, 
whereas poorly measured modes are downweighted and 
effectively excluded from our estimate. 
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The Wiener-filtered result has the desirable property 
of minimizing the quantity where s = Xg — Xg 

is the error vector. It thus represents our "best guess" 
as to what the cosmological signal looks like, at the ex- 
pense of losing the information in highly contaminated 
modes. Since these modes are irretrievably lost (without 
better foreground modeling), one must also Wiener- filter 
the theoretical models in order to make a fair compari- 
son. We show such Wiener-filtered theoretical spectra in 
Section [IVI 



In contrast to the Wiener filter, our previous esti- 
mator minimized (Is^p) only under the constraint that 
(xg) = Xg. It also had the property that it minimized 
= (y — Axs)^N~-'^(y — Axg) [49 , and so the estimator 
constructed an unbiased model that best matched the 
observations. The model will thus include modes that 
are so contaminated that there is no hope of measuring 
the cosmological signal in them, since these modes, how- 
ever contaminated and error-prone they might be, are in 
fact part of the observation. The result is a foreground 
contaminated spectrum, which the Wiener-filtered result 
avoids. 

It must be emphasized, however, that because the 
Wiener filter W is an invertible matrix, there is no change 
in information content. The information about the cos- 
mological signal was always there, and moving to a dif- 
ferent basis or Wiener filtering simply made it more ap- 
parent. Wiener filtering is simply a convenient post- 



processing visualization tool for building intuition, and 
will not change our ability to constrain the physics in 
our theoretical models. 



IV. A DESIGNER'S GUIDE TO EXPERIMENTS 
THAT PROBE THE DARK AGES 

Having established a general framework for analyzing 
data from global signal experiments, we now step back 
and tackle the problem experimental design. In this 
section, we will consider experiments that are designed 
to target the trough in brightness temperature between 
30 MHz and 100 MHz {i.e. probing the dark ages). In 
Section |V| we will discuss experiments that target the 
reionization window from 100 to 250 MHz. 

Our guide to experimental design will be the quantity 
given by Equation (57) [or equivalently. Equation 



(58)]. As an inverse covariance, this quantity represents 
an information matrix. Our goal in what follows will be 
to design an experiment that maximizes the information. 
One approach to maximizing the information would be 
a brute- force exploration of parameter space. However, 
this is a computationally intensive, "black box" method 
that does not yield intuition for the various trade-offs 
that go into experimental design. Instead, we take the 
following approach. We first rewrite our information ma- 
trix by separating out the monopole {£ = 0) term in our 
spherical harmonic expansion: 
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where we have taken advantage of the fact that when I = 
0, the power spectrum takes the value = 47r(i^(f))^ = 
47r(l/T(f))^, with expectation values (• • • ) denoting a 
spatial average in this case. In the next few subsections 
we will interpret each piece of Equation (|62| individu- 
ally, using quantitative arguments to arrive at qualitative 
rules of thumb for global signal experiment design. 

Unless otherwise stated, in what follows we will per- 
form calculations with tint = 100 hrs and assume an in- 
strument with a channel width of Az/ = 1 MHz. As 
stated in Section II B 1[ we will imagine that our fore- 
ground model has roughly the same accuracy as that of 
the global sky model of [19 , and thus set £q = 0.1 at 
a resolution of ^fg = 5°. Since this is also the "native" 
resolution of the GSM, we assume that the errors are cor- 
related over (7 = 5°. Because we do not possess angular 
foreground models that have much finer resolution than 
this (the GSM can at most be pushed to a 1° resolu- 
tion by locking to the Haslam map [50 at 408 GHz) , we 
will not be analyzing experiments with beams that are 
smaller than 5°. The 5° figure that will be referenced re- 



peatedly in the following sections thus does not represent 
some optimized beam size, but should instead be thought 
of as a fiducial value chosen to highlight the advantages of 
having a small beam. We do note, however, that thanks 
to the cosmological anisotropy noise described in Section 
|IIB 3[ an instrument with finer beams than a few degrees 
will likely be suboptimal. 



A. How much does angular information help? 

We begin by providing intuition for the role of angu- 
lar information in a global signal experiment where such 
information is available. In general, angular information 
can potentially help one's measurement by: 

• Allowing regions of the sky that are severely con- 
taminated by foregrounds to be downweighted or 
discarded in our estimate of the global signal. 

• Providing information on the angular structure of 
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the foregrounds, which can be leveraged to perform 
spatial foreground subtraction. 

In what follows we will assess the effectiveness each of 
these strategies. 



1. Downweighting heavily contaminated regions 

We begin by addressing the first point. Consider 
the first term of Equation ( [62| ), the information matrix, 
which contains two factors of Since 1/T is a con- 

vex function for T > (almost always the case for our 
foregrounds^, we know from Jensen's inequality that 



> 



1 



(T)' 



(63) 



where equality holds only in the limiting case of uniform 
foregrounds over the entire sky. Put another way, equal- 
ity holds only when an experiment has no angular sen- 
sitivity, which is mathematically equivalent to an exper- 
iment that measures the same value in all pixels of the 
sky. With angular sensitivity, (1/T) rises above 1/(T), 
increasing the information content via the monopole term 
of Equation (62). More intuitively, we can define the 
quantity: 



Teff ■ 



(64) 



which can be thought of as the effective foreground tem- 
perature of the sky after we have taken advantage of our 
angular sensitivity to downweight heavily contaminated 
regions. In Figure [S] we show this quantity as a function 
of angular resolution at 79 MHz (the behavior at differ- 
ent frequencies is qualitatively similar). This is shown 
along with the spatially averaged foreground tempera- 
ture, which is the relevant temperature scale for exper- 
iments with no angular sensitivity, and the foreground 
temperature in the coolest pixel of the sky, which will 
be useful later in Section IV A 3 As one expects from 



the preceding discussion, the effective foreground tem- 
perature is strictly less than the averaged temperature, 
demonstrating that residual foreground errors can be re- 
duced by downweighting heavily contaminated regions 
of the sky in our analysis, which is only possible if one 
has sufficiently fine angular resolution. The benefits be- 
come increasingly pronounced as one goes to finer and 
finer angular resolution. Intuitively, this occurs because 
the finer the angular resolution, the more "surgically" 
small regions of heavy contamination {e.g. bright point 
sources) can be downweighted. 



^'^ Foregrounds that can be negative, such as those from the 
Sunyaev-Zeldovich Effect, are completely negligible compared to 
the sources considered in this paper. 
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FIG. 8. Effective foreground temperature Teff = (1/T)~^ 
(purple squares), average foreground temperature (T) (gold 
diamonds), and minimum foreground temperature Tmin (blue 
circles) in the sky as a function of the angular resolution at 
79 MHz. In each case, a high resolution foreground map was 
convolved with a Gaussian beam with a fuU-width-half-max 
given by values on the horizontal axis. When interpreted 



with the information matrix [Equation (62)], this shows that 
having angular sensitivity allows an optimal downweighting 
of regions heavily contaminated by foregrounds, reducing the 
effective foreground contamination in the global signal. The 
effect is particularly pronounced with fine angular resolution. 



2. Using spatial structure to perform angular foreground 
subtraction 



Now consider the second potential use of angular in- 
formation, namely, to perform spatial foreground sub- 
traction by taking advantage of the angular structure or 
correlations of foregrounds. The extra information con- 
tent that would be provided by such a procedure is rep- 
resented by the second term of Equation ( 62 ) , where the 



detailed spatial properties of the foregrounds enter in the 
quantity C^. 

To quantify the added benefit of including angular cor- 
relations, we compute the quantity 



7 = (x*S-ix« 



(65) 



which can be thought of as the "number of sigmas" with 
which the cosmological signal can be detected. Since 
Equation (62) gives us X)"^ in a form that is decomposed 



both in angular scale i and foreground mode number r^, 
we can express 7^ in the same decomposition. This is 
shown in Figure [9] for instrumental beams with FWHMs 
of 5° (top panel), 30° (middle panel), and 90° (bottom 
panel). Several trends are immediately apparent. First, 
in all cases the detection significance is concentrated in 
the low i modes, in particular the i = mode (note the 
logarithmic color scale). As expected, the use of higher i 
information to boost detection significance is more preva- 
lent when the instrumental beam is narrow. It is also 
more prevalent for the first few {i.e. spectrally smoother) 
foreground eigenmodes. 
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FIG. 9. Contributions to the detection significance statistic 7^, as a function of tfie foreground spectral eigenmode number 77 
(vertical axis, increasing downwards) and spherical harmonic i (horizontal axis, increasing to the right). Top panel: instrumental 
beam with FWHM of 5°; middle panel: 30°; bottom: 90°. The plots are normalized to give unity when summed over all r\ and 
i. In all three cases, almost all of the detection significance comes from the i — ^ modes beyond the first few r\ modes (note 
the logarithmic color scale). As expected, the dominance of low i modes is more pronounced for experiments with low angular 
resolution. 



To gain intuition for this we can examine the second 
term of Equation (62). At high this term is expo- 
nentially suppressed by the exp \^\^(^ + 1)] factor in the 
denominator, a result of our instrument having finite an- 
gular resolution. At low ^, this behavior is counteracted 
by the other term in the denominator, which has the op- 
posite behavior with i and is proportional to the strength 
of the rf^ foreground mode as quantified by the eigen- 
value A^. A conservative estimate for ^max, the largest 
i mode for which there may be significant information 
content, can be obtained by equating these two terms, 
since at higher t the noise term suppresses the informa- 
tion content. This estimate is a conservative one, for the 
foreground correlations themselves (encapsulated by C^) 
are almost always a decreasing function of ^, pushing the 
peak of useful information content to lower i. Ignoring 



this complication to obtain our rough estimate, we have 



In 



47r 



(66) 



Several conclusions can be drawn from this. First, we see 
that quantities such as £0 appear under the square root 
of the logarithm. Thus, the dependence of tmax on most 
model parameters is extremely weak, and so our quali- 
tative conclusion that most of the detection significance 
comes from low i should be quite robust. The excep- 
tion to this is the dependence on A^, which we saw from 
Figure [4] decays exponentially. We therefore expect a 
non-negligible decrease in imax as one goes to higher and 
higher foreground eigenmodes, a trend that is visually 
evident in Figure |9] Intuitively, the higher foreground 
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eigenmodes are an intrinsically small contribution to the 
contamination, so they are difficult to measure with high 
enough signal-to-noise to be deemed sufficiently "trust- 
worthy" for foreground mitigation purposes. In contrast, 
the first few foreground eigenmodes are easy to measure, 
but this is of course only because they were a large con- 
taminating influence to begin with. From Equation (66), 



40 r 



we can see that extra integration time does allow the 
weak foreground modes to be measured sufficiently well 
for their fine angular features to be used in foreground 
removal, but as we suggested above, the logarithmic de- 
pendence on tint makes this an expensive strategy for 
experiments with coarse (high ^5) beams. 

Based on our discussion so far, one might be tempted 
to conclude that it is unnecessary for 21 cm global signal 
experiments to take advantage of angular resolution at 
all. Another argument for this is presented in Figure p^Oj 
where we once again compute our significance statistic 7, 
but this time vary the number of I modes used in our an- 
gular foreground subtraction. In other words, to produce 
Figure [To] we took Equation (62) and performed the sum 
of I only to some cutoff to produce a truncated informa- 
tion matrix. One sees that taking advantage of angular 
correlations in our foreground model does very little to 
boost the detection significance. As far as this statistic is 
concerned, most of the gain in having angular informa- 
tion comes from the effect discussed in Section IV A 1[ 
namely the downweighting of heavily contaminated re- 
gions to suppress foregrounds before we average. This is 
responsible for the way the finer angular resolution ex- 



periments have higher 7 in Figure 10 even if the higher ^ 
modes are not used for angular foreground mitigation at 
ah. 

The simple downweighting of contaminated regions, 
however, is a strategy that is of limited power. Figure 
[8] tells us that even with 5° beams, we can expect at 
most a factor of 2 mitigation in foreground contamina- 
tion (compared to an experiment with no angular resolu- 
tion at all). Thus, one may argue that angular resolution 
is simply not worth the financial cost or the instrumen- 



tal challenges, since Figure 10 shows that a statistically 
comfortable detection can be made even without angu- 



lar resolution. As we shall see in Section IV the high 



statistical significance seen in Figure 10 is the result of 
spectral methods, which may lead one to abandon angu- 
lar methods entirely. 

Such a conclusion, however, would be a misguided one. 
For while 7 (the "number of sigmas" of our detection) is 
a useful and commonly quoted statistic, it is somewhat 
crude in that it only tells us whether we are seeing a sig- 
nificant departure from our residual noise and foreground 
models. In other words, it tells us that if the emission 
from the sky did in fact take the form of foregrounds 
plus the fiducial global signal assumed in Section II A[ 
we would be able to detect something above pure fore- 
grounds and noise in our measurements. It does not tell 
us what global signal has been detected, or whether our 
final result is indeed correct. 
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FIG. 10. Detection significance 7 for experiments of different 
levels of angular resolution, plotted as a function of various I 
cutoffs, beyond which the angular correlation information is 
not used for foreground mitigation. In all cases, the detection 
significance is high, which we shall see in Section |IVB| is a 
result of spectral signatures. Since the curves in this figure are 
essentially flat, one may initially conclude that there is little 
added benefit to using angular information when dealing with 
foregrounds. However, as Figure [TT] shows, this conclusion is 
incorrect. 
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FIG. 11. Error bars on the measured spectrum at 79 MHz, 
shown as a function of various I cutoffs, beyond which the an- 
gular correlation information is not used for foreground mit- 
igation. Excluding any information of angular correlations 
in the analysis, the errors are highest for the experiments 
with coarse angular resolution, because such experiments are 
less able to downweight heavily contaminated regions of the 
sky, as discussed in Section [IV A 1| The situation reverses at 
moderate cutoffs in ^, and the coarser experiments have an 
advantage because higher I modes of the foregrounds were 
never measured in the first place. However, information in 
these unmeasured modes are lost forever, and cannot be used 
for angular subtraction, unlike in experiments with fine an- 
gular resolution. At high cutoffs in ^, where one is using all 
available information, fine angular resolution once again wins, 
giving the smallest error bars. 
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More precisely, suppose that in addition to Figure [Toj 
we also consider the error bars {T,aa)^^'^ on our measured 
frequency spectrum. This is shown in Figure [TT] where 
we show the error bars at 79 MHz, again as a function of 
the maximum i mode used for angular foreground mit- 
igatiorp^ In this case, we see large changes as we vary 
our i cutoff. In all cases, the errors go down as more 
and more angular correlation information is used. At 
the lowest the finer angular resolution experiments do 
better because they are able to downweight foreground 
contaminated regions in a more spatially precise way (as 
we discussed in Section IVAl). Going to moderate i 



cutoffs, the coarser experiments do better because they 
sampled a blurred version of the sky to begin with, and 
thus do not need to go to as high i in their data analysis 
to make the most of their angular correlation informa- 
tion. However, the blurry measurements do not possess 
the fine details needed perform a high-precision angular 
foreground subtraction, so at higher i cutoffs, the fine 
angular resolution experiments give the lowest errors. 



Figures [To] and [TT] together tell us that even though we 
do not need angular correlation information to tell that 
our measurement contains something more than just pure 
noise and foregrounds, the error bars are large enough 
(being in the 5 to 10 K range at £ = 0) for our results 
to be consistent with a wide variety of theoretical spec- 
tra. Since we expect the cosmological signal to have a 
maximum amplitude of ~ O.IK, such error bars make 
it impossible to use our measurements to distinguish be- 
tween different models. 

In short, we conclude that the use of angular corre- 
lations is crucially necessary, not for the purposes of a 
mere detection of something above the noise and fore- 
ground model, but to further reduce error bars to the 
point where interesting constraints can be placed on the- 
oretical models. 



3. Excluding heavily contaminated regions from 
observations 

While we argued in the previous section that making 
use of angular correlations is crucial for bringing down 
residual foregrounds in one's error bars, in this section 
we briefiy consider an alternative: selectively discarding 
heavily contaminated foregrounds from one's analysis en- 
tirely (or equivalently, simply not observing the dirtiest 



portions of the sky, such as the Galactic plane). By ex- 
cluding the dirtiest foreground pixels one can reduce the 
effective foreground temperature below (1/T)~^, bring- 
ing one closer to the minimum foreground temperature 
in the sky, plotted as the T^in curve in Figure [8] In this 
section, we examine whether the exclusion of dirty sky 
regions does indeed produce smaller error bars in one's 
final measurement. 

We begin by considering just the £ = term in our 



expression for the information matrix [Equation (62)]. 
Again, this is equivalent to omitting angular correlation 
information from our analysij^ Discarding all but the 
first term in the sum over we can simplify our ex- 
pression for the information matrix using the spectral 
coherence matrix Q: 
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where we have used the fact that Qq,^ = 
X]^(vry)Q;(v^)^A^, and have made the substitution 
47r^/sky^sky (that this is an appropriate step can be 
verified by a more detailed derivation of X)"^ that as- 
sumes incomplete sky coverage from the beginning). We 
have also defined as a diagonal matrix containing 
the (1/T) values, i.e. 
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(68) 



From all this, we can easily compute the error bars at a 

1 /2 

specific frequency, which are given by T,Ja • Expressing 
this in terms of our definition of the effective foreground 
temperature [Equation (64)], we have 
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(69) 



where we have taken advantage of the fact that Qaa — 1- 
This equation reveals that our measurement error con- 
sists of two terms added in quadrature. The first is from 
residual foregrounds, and scales as 1/ ^^/sky, since us- 
ing more pixels allows us to average down the error in 
our foreground model. The second is from instrumental 
noise. It depends on the bandwidth lS.v and the total 
integration time tint^ but not on the number of pixels 



For Figures [ll] and [21] on/?/, we make the approximation that the 
angular cross-power spectrum between frequency channels ol and 
/3, C]^'^^, is a separable function of the two frequencies. There 
is no a priori reason to expect this to be the case, but we find 
that it is an excellent approximation. We make use of this ap- 
proximation for purposes of numerical stability. In any case, it is 
a conservative assumption, for a non-separable cross-correlation 
(which might occur, for instance, if different foreground sources 
dominate at different frequencies) only provides more informa- 
tion with which to subtract foregrounds. 



In this section we are considering a scenario where angular cor- 
relations are not used. One could imagine alternatively a situ- 
ation where we not only discard (or do not observe) the most 
heavily contaminated regions, but in addition also take advan- 
tage of angular correlations between the remaining pixels. For 
the purposes of this paper, we do not analyze this scenario, for 
the necessary correlation models that this would entail involves 
additional experiment-specific details such as an experiment's 
scanning strategy. 
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FIG. 12. Expected error bars on a global signal measurement 
at 79 MHz, as a function of the fraction of the sky observed. 
The observations are assumed to be of the cleanest parts of 
the sky (e.g. a fraction of 0.3 means observing the cleanest 
30% of pixels, even if they are not contiguous) and are sim- 
ulated at a resolution of 5 degrees. Different curves denote 
different values of the fiducial foreground model error param- 
eter £. At any realistic level of foreground modeling error, 
it is advantageous to cover as much of the sky as possible to 
reduce the modeling error, even if it means dealing with a 
higher effective foreground temperature. 



/sky. This is because our goal is to measure a cosmo- 
logical monopole, which means the instrumental noise is 
averaged down at the same rate regardless of whether all 
the time is spent integrating on a small patch of the sky 
or the observation time is spread over the entire sky. The 
entire expression is modulated by the effective foreground 
temperature. 



Armed with Equation ( 69 ) , we can answer the question 



of whether it is wise to spend our integration time observ- 
ing only the cleanest regions in the sky. Suppose we took 
the pixels in our sky and sorted them from least fore- 
ground contaminated to most foreground contaminated. 
One can then imagine (excluding practical logistics such 
as scanning strategy) observing in only the /sky cleanest 
portions of the sky. The effective foreground temperature 
Teff is an increasing function of /sky, since it increases 
from Tmin to sky- Thus, if the measurement er- 

rors were determined by Teff alone, it would be best to 
spend all our integration time on the single cleanest pixel 
of the sky. However, we can see in Equation (69) that 



there is a competing influence: the y ^ ^q^^ — ^ f. ^az^ 

a decreasing function of /sky, because sampling more of 
the foreground sky allows one to average down the errors 
in our foreground model. 

In Figure [12] we examine this trade-off, where we show 
the expected error bars on a global signal measurement 
at 79 MHz, as a function of the fraction of the sky ob- 
served. The observations are assumed to include only 
the cleanest parts of the sky. For instance, a coverage 
fraction of 0.3 means the observations include only the 



cleanest 30% of pixels. From the plot, one can see that 
for any reasonable level of fiducial foreground modeling 
error Sq, the errors are minimized by including as much 
of the sky as possible, even if that means sampling some 
highly contaminated foreground regions. In words, this is 
a statement that the foreground modeling errors average 
down more quickly than the foreground amplitudes rise 
as we go to dirtier parts of the galaxy. Mathematically, 
the instrumental noise term turns out to be negligible 
compared to the residual foreground error term. The 
second piece of Equation (69) therefore ends up scaling 
as 1/ Y^/sky, which happens to decay more quickly than 
Teff grows with /sky. The net result is that overall mea- 
surement error is reduced by covering as much of the sky 
as possible, although in practice the gains appear to be 
minimal beyond a 30% coverage of the sky. The trend 
of decreasing error with increasing sky coverage changes 
only if the foreground modeling error is unphysically low 
(e.g. the £0 = curve in Figure 12). Only then is it 



advantageous to avoid observing the dirtier parts of the 
skf^ 



B. The role of spectral information 

In the previous section, we examined the extent to 
which angular information can reduce the error bars on a 
global signal measurement. In this section, we highlight 
the crucial role that spectral information plays. 



Recall from Figure 10 that even an experiment with 
a FWHM instrumental beam that is as wide as 90° can 
make a statistically significant (~ 14<j) detection of a 
signal above residual noise and foregrounds. Since a 90° 
beam is essentially equivalent to having no angular infor- 
mation at all, it follows that the statistical significance 
of such a detection must come from spectral information. 
To see this clearly, let us express our measurement covari- 
ance and the cosmological signal in a basis where the co- 
variance is diagonal. Intuitively, this can be thought of as 
a "residual noise and foregrounds eigenbasis" . Such a ba- 
sis is convenient because without off-diagonal elements, 
the error bars (given by the square root of the diagonal 
elements of the covariance) can be directly compared to 
the signal to arrive at a signal-to- noise ratio. 

In the bottom panel of Figure [13) we show the absolute 
amplitude of the theoretical signal in our residual noise 
and foreground basis for an instrument with a 90° beam. 



15 While Figure [12] was produced using simulations at a 5° resolu- 
tion, we expect our conclusion — that one should cover as much 
of the sky as possible, even if it means observing regions that 
are highly contaminated by foregrounds — to hold at other rele- 
vant resolutions. Finer resolutions are unlikely to be achievable 
at such low frequencies, and at coarser resolutions the act of 
discarding pixels does not reduce the effective foreground tem- 
perature by very much (as one can see from Figure |8|. The 
benefits from going to higher sky coverage fraction are thus even 
more pronounced. 
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Also on the plot is the square root of the (now diago- 
nal) covariance, i.e. the error bars on our measurement 
in this basis. Still focusing on just the bottom panel for 
now, we note several features. As expected, the error 
bars show an exponential decay over the first few modes, 
corresponding to an error budget that is dominated by 
residual foregrounds. In such a regime, the cosmologi- 
cal signal is completely sub-dominant. As one goes to 
higher modes, the errors are determined by a more bal- 
anced mixture and noise and foregrounds, and the decay 
is less rapid. From this figure, we have a simple expla- 
nation for why it is possible to have a high detection 
significance in spite of large error bars in our frequency 
spectrum — plotting our measurement as a function of fre- 
quency represents a bad choice of basis, with foregrounds 
dominating every frequency channel. With a better basis 
choice, it is clear that there are a small number of modes 
(eigenmodes ~ 25 to ~ 50) that can be measured with 
signal-to-noise ratio greater than unity, and it is these 
modes that are providing all the detection significance. 
However, our inability to measure any of the other modes 
limits our ability to accurately constrain the shape of the 
global signal spectrum. 

The difficulty in constraining the shape of the spec- 
trum can be further understood by examining the eigen- 
vectors corresponding to our residual noise and fore- 
ground eigenmodes. In Figure [l4]we show the first few 
eigenmodes, which from Figure]l3| we know are essen- 
tially unmeasurable. These eigenmodes are all spectrally 
smooth, and thus any such patterns in the measured 
spectrum will be lost in the measurement. In the lan- 
guage of our residual noise and foreground basis, these 
modes are very poorly constrained, and so our experi- 
ment will be unable to tell the difference between two 
cosmological scenarios that differ only in these unmea- 
surable modes. Given that the cosmological signal is ex- 
pected to be reasonably smooth spectrally, being unable 
to measure these smooth modes is problematic, for it 
means that there will be a wide class of reasonable cos- 
mological models that our experiment will not be able 
to tell apart. Mathematically, this is why we found that 
the error bars on the final measured spectrum were large 
for experiments with no angular resolution. Such exper- 
iments rely too heavily on spectral information for their 
foreground subtraction, and since both the foregrounds 
and global cosmological signal are rather smooth, a sharp 
constraint on the signal is very difficult. 

In Figure [Tsj we show several of the eigenmodes that in 
contrast can be reasonably well-measured. Immediately 
striking is the fact that these eigenmodes are all localized 
in the trough in the cosmological signal between 50 and 
80 MHz. It is therefore information from the trough that 
is providing most of the detection significance. Next, 
we notice that many of the eigenmodes have a double- 
peaked structure, with one peak positive and one nega- 
tive. With such a structure, each eigenmode is probing 
differences between neighboring frequencies. Looking at 
differences rather than raw values has the advantage of 
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FIG. 13. Absolute value of the theoretical cosmological signal 
(blue circles) and predicted error bars (purple squares) , shown 
in a residual foreground and noise eigenmode basis for an ex- 
periment with a FWHM instrumental beam of 5° (top panel) 
and a FWHM of 90° (bottom panel). In the 90° case, high 
signal-to-noise detections of the cosmological signal are pos- 
sible in a small handful of modes, and it is these modes that 
provide all the detection significance. However, the inability 
to constrain any other modes results in large error bars in the 
final measured global signal spectrum. With a 5° beam, our 
angular foreground subtraction methods allow more spectral 
modes to be measured at high signal-to-noise, contributing 
to a more faithful reconstruction of the shape of the global 
spectrum, with smaller error bars. Note that the cosmological 
signal in the two panels of this figure look slightly different be- 
cause the measurement covariances change when one changes 
the beam size, which in turn changes our eigenbasis. 



being more immune to foreground contamination, since 
the foregrounds are monotonically decreasing functions 
of frequency, so differencing neighboring frequency chan- 
nels will cancel out some of the foregrounds. An alter- 
native, but equivalent way to understand this heuristic 
argument is to think of these "difference modes" as finite- 
difference versions of derivatives. Measuring these modes 
is thus equivalent to measuring the derivative of the spec- 
trum. The algorithm is simply taking advantage of the 
fact that even though the foreground spectrum may be 
larger than the cosmological signal in amplitude, the re- 
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FIG. 14. First few (foreground residual dominated) eigen- 
modes of the measurement covariance XI for an experiment 
with an instrumental beam of FWHM 90°. 
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FIG. 15. The 30'^, 35^^, 40'^, 45^^, and 50^^ eigenmodes of 
the measurement covariance S, all of which are modes that 
can be measured with signal-to-noise ratio greater than one. 
The computations were performed for an experiment with a 
FWHM instrumental beam of 90°. Shown in dotted black is 
our fiducial cosmological signal, arbitrarily rescaled in ampli- 
tude. 



high signal-to-noise eigenmodes are all concentrated in 
the trough of the cosmological signal, perhaps one can 
simply give up on information from the low signal-to- 
noise modes, and limit oneself to constraining proper- 
ties of the trough. Unfortunately, the trough itself is 
quite smooth, and thus much of its amplitude comes 
from the unmeasurable low signal-to-noise modes. To see 
this quantitatively, we can use th e Wiener filtering tech- 
nique described in Section III C , which (as we discussed 



above) is designed to automatically exclude poorly mea- 
sured modes by appropriately weighting the data. In the 
bottom panel of Figure [T6j we show the expected cosmo- 
logical signal, as well as the same signal after it has been 
Wiener-filtered appropriately for an instrument with a 
90° FWHM beam. The filtered result represents the best 
we can possibly do after downweighting the heavily con- 
taminated modes. We see from this that in our attempt 
to eliminate heavily contaminated modes, we have also 
inadvertently destroyed a good portion of the cosmolog- 
ical signal. 

By now, it should be clear that in the presence of fore- 
grounds, angular information is necessary for suppressing 
error bars to an acceptable level. We now turn to ana- 
lyzing an experiment with angular information. In the 



top panel of Figure [13] we show the signal and the er- 
rors in a residual noise and foreground eigenbasis for an 
instrument with a FWHM beam of 5°. Comparing this 
the bottom panel, we see that many more modes can be 
measured at high signal-to- noise, including some of the 
smoothest (lowest eigenmode number) modes. We thus 
expect to be able to be able to mitigate foreground con- 
tamination without destroying as many features in the 
cosmological power spectrum, and indeed we can see in 
the top panel of Figure [16] that the Wiener- filtered cos- 
mological signal more accurately reflects the shape of the 
fiducial spectrum. 



verse may be true of the derivativej^ Of course, here 
the modes are only approximately derivative-like, since 



it is clear from Figure 15 that there is an asymmetry 
in height and width between the positive and negative 
peaks. This is an indication that the algorithm is doing 
a little more than just taking the derivative, but in any 
case, all its behaviors can be captured by calculating the 
eigenvectors as we have done. 



Figure 15 may seem to suggest a strategy for extract- 
ing useful cosmological information out of an experiment 
that has insufficient angular resolution to produce small 
error bars in a measured spectrum. Since the (relatively) 



The fact that measurements seem to be quite sensitive to the 
derivative of the global signal is intriguing, for it has been sug- 
gested [51] that the derivative could be a way to distinguish be- 
tween the X-ray heating from dark matter annihilations and that 
from more conventional astrophysical sources. 



1. Summary: the role of spectral information 

It is important to stress that in discussing the role of 
spectral information in this section, the result from Sec- 
tion nil Al remains true: aside from a direct subtraction 
of a foreground model spectrum, purely spectral methods 
are formally unable to reduce errors from residual fore- 
grounds. Without angular information to help with fore- 
ground subtraction, one must simply hope that the fore- 
ground spectra look sufficiently different from the cos- 
mological signal that interesting constraints on theoret- 
ical models can be placed without using modes that are 
known to be heavily contaminated. Unfortunately, these 
modes also contain a significant fraction of the cosmo- 
logical signal, and in bottom panel of Figure [16] we saw 
that this meant that many of the interesting features of 
the cosmological signal would be washed out along with 
the foregrounds. This also manifested itself in Figure [TTJ 
where we saw that with no angular resolution, the error 
bars on the final measurement ended up being unaccept- 
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FIG. 16. Comparisons between the true global signal and 
ones that have been Wiener- filtered. The top panel is for an 
experiment with an instrumental beam of of 5° width, while 
the bottom panel is for one with a 90° beam. The Wiener 
filter eliminates modes that cannot be measured without an 
extremely accurate foreground model. The filtered curves in 
this figure thus encapsulate the best spectra that one can ex- 
pect to measure, given some of the unavoidable similarities 
between the signal and foregrounds, as viewed by an exper- 
iment with different beam widths. With a 90° resolution, 
many of the measured spectral modes are heavily contami- 
nated, and in eliminating them from our final result, we have 
also washed out a non-negligible fraction of the cosmologi- 
cal signal. In comparison, foregrounds can be cleaned more 
aggressively using angular information from a 5° resolution 
experiment, and fewer features in the cosmological signal are 
washed out. 



ably high. In order to get small error bars (and sub- 
sequently put constraints of theoretical scenarios), one 
must use angular information to aid with foreground sub- 
traction. 



C. Instrumental noise and integration time 

In this section, we consider the effects of varying the in- 
tegration time. Increasing the integration time decreases 
the instrumental noise in our measurement, and we now 
examine exactly how this affects our detection signifi- 
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FIG. 17. A plot of 7 = Vxi5]~ixs, the "number of sigmas" 
with which our fiducial cosmological signal can be detected, 
as a function of integration time. The detection significance 
is seen to rise with greater integration, but because the errors 
involve residual foregrounds in addition to instrumental noise, 
the rise is less rapid than what would be expected from a 
simple scaling. A fiducial foreground model error of so — 
0.1 and an instrumental beam with FWHM of 5° was assumed 
to make this plot. 



cance and error bars. 

In Figure [T7| we show the "number of sigmas" 7 with 
which the cosmological signal can be detected, as a func- 
tion of time. The plot is for an experiment with a 5° 
instrumental beam, but apart from a simple rescaling of 
the amplitude, the plot is essentially identical for wider 
beams. As expected, the detection significance increases 
with extra integration time, although not as Vt, as would 
be the case if our experiment were limited only by in- 
strumental noise. A power law fit to the curve reveals 
that 7 scales roughly as t^'^, suggesting that residual 
foregrounds cannot be perfectly sequestered into a few 
eigenmodes, and affect even those high signal-to-noise 
spectral modes that are responsible for giving us most of 
our detection significance. 

Turning to the error bars, we find that the behav- 
ior is different for experiments with angular sensitivity 
compared to those without. In Figure 18 we show the 
measurement error at 79 MHz as a function of time, nor- 



malized to the error of each experiment after 50 hours of 
integration. Performing fits to the curves reveals a 
scaling for the 5° case and a scaling for the 90° 

case. Thus, in neither case do we have the scaling 
that one would expect from pure instrumental noise, but 
the errors are integrated down more rapidly when there 
is angular information. Intuitively, integrating for longer 
without angular information allows us to reduce our er- 
rors in the high signal-to-noise eigenmodes that we can 
already access, but does not provide us with the ability 
to measure new spectral modes that were limiting our 
ability to constrain the shape of the global signal. The 
situation is different when there is angular information, 
because in that case the foreground residuals are suffi- 
ciently well controlled for instrumental noise reduction 
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FIG. 18. Errors at 79 MHz as a function of integration time, 
normalized to the error after 50 hrs of integration. The error 
bars are integrated down more rapidly for the experiment 
with the 5° beam than the one with the 90° beam, but both 
scalings are slower than t~^/^, which would be the case if the 
measurements were instrumental noise dominated. A fiducial 
foreground model error of So = 0.1 was assumed to make this 
plot. 



FIG. 19. A plot of the error at 79 MHz in the extracted global 
21 cm signal, shown as a function of the fractional error in our 
foreground model sq. At unrealistically low so, instrumental 
noise is the dominant source of error, and the final errors do 
not vary with sq. As we increase the foreground model errors, 
residual foregrounds dominate and the errors increase with eo . 
An integration time of 100 hrs was assumed to make this plot. 



to produce an appreciable improvement in the error bud- 
get. 

In summary, then, we find that extra integration helps 
to increase the detection significance whether there is an- 
gular information or not. However, the decrease in the er- 
ror bars is more pronounced when the angular resolution 
is fine. This is because in such an experiment, the spec- 
tral eigenmodes that are the most useful for constraining 
the shape of the global signal are not completely residual 
foreground dominated, and so a reduction in instrumen- 
tal noise has a more significant effect on the errors. 



D. Reference foreground survey uncertainty 

To further our understanding of how the residual fore- 
grounds affect our measurement, we now consider the ef- 
fects of varying Sq, the fractional error in our foreground 
model. In Figure[T9) we plot, as a function of Sq, the error 
bars on the measured global signal at 79 MHz. At unreal- 
istically low foreground modeling error, the observations 
are dominated not by residual foregrounds, but instead 
by instrumental noise. The errors are therefore roughly 
independent of Eq. Once 6:o^fg/\/^sky ^ the 
errors become increasingly dominated by residual fore- 
grounds, and therefore rise with sq. Note that the rise 
occurs much more rapidly when the angular information 
is poor. This can be understood through the lens of our 



result from Section HI A which stated that with only 
spectral information, nothing can be formally done to 
mitigate errors in the foreground model. As Sq rises, 
larger errors must simply be accepted by those experi- 
ments without angular resolution. 

The transition from a noise-dominated experiment to 
a foreground-residual dominated experiment can also be 



seen in Figure |20j where we show the detection signifi- 
cance 7 as a function of Sq. There are several regimes of 
interest. At very low Sq, the foreground contribution to 
the covariance is negligible, and the information matrix 
[Equation (58)] is diagonal to a good approximation. Its 
eigenvectors(our residual noise and foreground basis vec- 
tors) are thus delta functions in frequency, and every fre- 
quency band can be measured easily. As eo increases be- 
yond ~ 10~^ {i.e. beyond the noise dominated regime), 
the detection significance drops, and an examination of 
the eigenmodes reveals the double-peaked structure seen 



in Figure 15 Intuitively, the residual foregrounds are be- 
coming more of a contaminating influence, and precau- 
tions (such as differencing neighboring frequency chan- 
nels, which is what the double-peaks do) need to be 
taken. As one approaches eo ~ 0.05 and beyond, the 
detection significance drops more sharply because one 
ceases to be able to make statistically significant mea- 
surements beyond the 50 to 80 MHz trough. 

As with any experiment that is plagued by foreground 
contamination, the better one can model the foregrounds. 



the better the final results. However, Figure 20 reveals an 
interesting result — unless fractional errors in foreground 
models can be suppressed beyond the ^ 10~^ level to the 
~ 10~^ level, the improvements in detection significance 
are not dramatic. 



V. A DESIGNER'S GUIDE TO EXPERIMENTS 
THAT PROBE REIONIZATION 

We now turn to examining experiments that target the 
reionization regime, from 100 to 250 MHz. The analysis 
formalism remains essentially the same as that in the pre- 
vious section, and most results remain unchanged. We 
therefore focus on highlighting the differences. We will 
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FIG. 20. The detection significance 7 of the fiducial global 
21 cm signal, plotted against the fractional error in our fore- 
ground model So. At extremely low £0, foreground residuals 
are a negligible source of error and the detection significance 
is constant. At higher so, foregrounds become more impor- 
tant, and 7 drops until the detection is driven by the trough 
feature between 30 and 80 MHz. At the highest £0, 7 drops 
further as foreground modeling errors become so high that 
even the trough is difficult to detect. An integration time of 
100 hrs was assumed to make this plot. 
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FIG. 21. Same as Figure [TT] showing error bars as a func- 
tion of the highest i mode used, but for ly — 198 MHz (more 
relevant for reionzation) . 



find that whereas a significant detection (but not a well- 
constrained measurement) could be achieved without an- 
gular resolution for the Dark Ages, for reionization there 
are scenarios where a positive detection simply cannot be 
made without angular information. 



Consider first the predicted error bars. In Figure 21 



we 

show the projected errors during reionization as a func- 
tion of the maximum ^ mode used in the analysis, just as 
we did for the Dark Ages in Figure [TT] Like before, the 
errors are seen to be relatively large if angular correla- 
tions are not used in the analysis (for comparison, recall 
that the signal is now ~ 30 mK and gradually decaying 
to zero). With angular correlations, the error bars can 
be suppressed to a level that allows different theoretical 
models to be distinguished from each other, exactly as 
we saw for the Dark Ages. 



FIG. 22. Some sample histories of the brightness temperature 
Th during reionization. Our samples here all have a midpoint 
of reionization Zr of 10. Different curves show different reion- 
ization durations. 



However, when we consider the statistical significance 
of our detection, the story departs from what we found 
previously. In particular, we will find here that the ability 
to make a statistically significant detection of a signal 
above noise and foregrounds is dependent on the duration 
of reionization A2:. This parameter enters into a general 
form of the reionization global signal that we adopt: 



T21 
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1 



10 



1/2 



tanh 



(70) 



where T21 = 27 mK and Zr is the redshift at the midpoint 
of reionization [21]. Comparing this to Equation ([T]), we 
see that this arises from assuming that the mean ionized 
fraction takes the form 



Xi{z) 



1 + tanh 



(71) 



This parameterization is identical to the one adopted by 
[21_ and [22 , and differs from that employed by WMAP 
j39] and the CAMB software package [52] only in that 
their tanh parameterization is in conformal time rather 
than redshift. Some sample brightness temperature sig- 
nals with differing Az but fixed Zr (equal to 10) are shown 
in Figure [22] 

Intuitively, one would expect reionization histories 
with larger Az to be more difficult to detect, since they 
give rise to cosmological signals that monotonically de- 
crease in smooth, gradual ways with frequency that are 
very similar to typical foreground spectra such as that 
shown in Figure |6] This is exactly what one sees in Fig- 
' ' As reionization becomes more and more abrupt. 
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ure 

its signature in the global signal becomes increasingly dis- 
tinct from that of the smooth foregrounds, and becomes 
easier to detect. For abrupt reionization, using higher I 
modes in the analysis improves the error bars, but does 
not increase the detection significance, just as we found 
for the Dark Ages. As we move to more gradual reion- 
ization scenarios, however, we see that the foregrounds 
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In Figure 25 we show the expected measurement er- 
rors, as weh as our fiducial cosmological model. With 
the error suppressed below the cosmological signal in the 
50 and 80 MHz trough region, it is likely that detailed 
constraints can be placed. (The errors can be further 
suppressed by binning the data points shown in the fig- 
ure, although there do exist non-negligible correlations 
between the errors, so the improvement does not quite 
scale as I/a/TV, where N is the number of data points). 



The results shown in Figure 25 represent a 25<j detection 
of our fiducial cosmological signal. 



FIG. 23. Detection significance 7 for various reionization du- 
rations Az (longer dashes correspond to greater Az), plotted 
as a function of various i cutoffs, beyond which the angular 
correlation information is not used for foreground mitigation. 
Whereas abrupt reionization scenarios can be easily distin- 
guished from foregrounds and to yield high-significance de- 
tections, for gradual reionization scenarios it is necessary to 
utilize angular correlation information to make a detection. 



become so difficult to distinguish from the cosmological 
signal that it becomes necessary to use angular correla- 
tions simply to obtain a detection. 

The necessity of angular information for probing ex- 
tended reionization histories is further demonstrated in 
where we show the contribution to 7^ as a 
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Figure 

function of £ and foreground eigenmode 7^, for an exper- 
iment with a FWHM beam of 5°. The assumed reion- 
ization scenario has a midpoint Zr = 10 and duration 
Az = 4. Unlike for the Dark Ages, where ffne angular 
correlations certainly helped — but did not dominate — 
the statistical signiffcance of our detection, we see that 
for extended reionization most of our ability to detect 
a signal above noise and foregrounds comes from high £ 
information. Angular information is thus crucial. 



VI. 



EXPECTED PERFORMANCE OF 
FIDUCIAL EXPERIMENTS 



In Sections IV andrVl we considered the various trade- 



offs in the experimental design of global 21 cm signal ex- 
periments. Having gained an intuition for the best types 
of experiments to build, we now consider some fiducial 
experiments and analyze their expected performance. 



A. A fiducial dark ages experiment 

As our fiducial dark ages experiment, we imagine an in- 
strument with a FWHM beam of 5° and a channel width 
of Au = 1 MHz. We assume a total integration time of 
100 hrs over the entire sky. For our foreground model, we 
assume that current constraints are accurate to the 10% 
level at an angular resolution of 5°. 



B. A fiducial reionization experiment 

As our fiducial reionization experiment, we also imag- 
ine an instrument with FWHM beam of 5°. We assume 
a channel width of Au — 2 MHz and an integration time 
of 500 hrs. Unlike the Dark Ages where the trough signa- 
ture aided one's detection, for reionization we find that 
it is necessary to integrate for longer to be able to detect 
extended reionization scenarios. 



In Figure 26 we show the measurement errors along 
with several reionization scenarios. The errors are seen 
to be well below the signal in all cases, suggesting that 
it should be possible detect a wide variety of reioniza- 
tion scenarios. This is further illustrated in Figure [27| 
where we show 7 [recall from Equation (65) that this is 
the "number of sigmas" of the detectionfas a function 
of the reionization midpoint Zr and duration Az of our 
fiducial model from Equation ([70|. As expected, sharper 
reionization histories (small Az) are more easily distin- 
guishable from smooth foregrounds, and thus are more 
easily detectable. Above Az ~ 2, however, we see that 
this no longer applies, and one must simply hope that 
reionization took place at lower redshifts, where the over- 
all amplitude of foregrounds is lower. In any case, we see 
from the figure that a positive (3cr or above) detection can 
be made over a wide region of parameter space, including 
rather extended reionization scenarios at high redshift. 

We emphasize that angular information is the key to 
making positive detections of the global 21 cm signal even 
when reionization is extended. To see this, consider Fig- 
ure 28 , where we show the detection significance contours 
for an instrument with a 90° beam. With very little 
angular information available, the detection significance 
7 goes down throughout parameter space. In addition, 
the rather horizontal orientation of the contours suggests 
that without angular information, 7 is driven primar- 
ily by the duration of reionization, with abrupt reioniza- 
tion scenarios more easily distinguishable over foreground 
spectra. Only with angular information is it possible to 
detect extended reionization scenarios. 

As a further demonstration of this, suppose one were to 
fit the data from an experiment to Equation (70), with 
T21, Zr^ and Az as free parameters. To estimate the 
precision with which the parameters can be constrained, 
we use the Fisher matrix formalism. Starting with our 
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FIG. 24. Same as Figure [9] except for an experiment probing an extended reionization scenario with Zr — 10 and lS.z — 4. 
Whereas most of the detection significance comes from the ^ = information in experiments that target the Dark Ages, an 
extended reionization scenario has a spectrum that is so similar to a typical foreground spectrum that most of the detection 
significance comes from using high i modes to aid foreground mitigation. 
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FIG. 25. Measurement errors and the absolute value of the 
fiducial cosmological model. The fiducial experiment has a 
channel width of 1 MHz, a FWHM beam of 5°, and covers 
the full sky over a 100 hr integration. The trough between 
50 and 80 MHz can clearly be measured accurately, allowing 
different cosmological scenarios to be distinguished from each 
other. These results have not been Wiener filtered, since the 
errors are small thanks to the use of angular correlations. 
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FIG. 26. Measurement errors and several fiducial reionization 
scenarios. The fiducial experiment has a channel width of 
2 MHz, a FWHM beam of 5°, and covers the full sky over a 
500 hr integration. In all cases the measurement error is far 
below the signal, ensuring a positive detection. The results 
have not been Wiener filtered, since this is unnecessary with 
small errors. 



251 



10 



11 



12 



Zr 



FIG. 27. Detection significance 7 [Equation (65)] as a func- 
tion of the midpoint of reionization Zr and the duration of 
reionization Az, plotted with a logarithmic color scale for an 
instrument with a 5° beam. The boxed numbers show 7 of 
each contour. A positive detection is possible over a wide 
range of reionization scenarios. 



equation for the Fisher matrix [Equation (38)] we sim- 
ply select T21, Zj.^ and as our parameters and use our 
measurement covariance 5] in place of the noise covari- 
ance C. The inverse of the Fisher matrix then provides 
us with the best possible covariance on our three param- 
eters. Performing the calculation for our fiducial exper- 
iment assuming a reionization model with T5 = 27 mK, 
Zr = 11 and Az = 3 yields error bars of 8.9 mK, 0.67, 
and 0.51 for those parameters respectively. Pairwise pro- 
jections of the error ellipsoid (obtained by marginalizing 
over the unplotted third variable) are shown in Figure 
I29I While in this case of rather extended reionization 
the detection is perhaps not as statistically significant as 
one might hope, it is at least possible, which is not the 
case if only spectral methods are employed. In addition, 
extra integration time results in tighter constraints, since 
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FIG. 28. Same as Figure [271 but for an instrument with a 90° 
beam (i.e. one with very httle angular information). The de- 
tection significance clearly decreases compared to Figure [27l 
where angular information was used to improve foreground 
subtraction. The horizontal orientation of the contours in 
this figure indicate that foreground separation in spectral- 
only experiments rely primarily on reionization happening in 
an abrupt fashion. 



the angular foreground subtraction techniques allow one 
to access some of the smooth spectral modes. 



C. Comparisons to cosmic microwave background 
constraints 

Constraints on reionization can also be derived from 
cosmic microwave background (CMB) experiments, and 
in this section we compare current and upcoming CMB 
constraints to those that can be expected from global 
signal measurements. As CMB photons travel from the 
surface of last scattering to us, they are Thomson scat- 
tered by free electrons from reionization, giving rise to an 
optical depth r. In particular, the optical depth is given 
by 



= (JT I 

Jo 



ne{z) ds 
1 -\- z dz 



dz, 



(72) 



where ar is the Thomson cross-section, z^ec is the recom- 
bination redshift, ne{z) is the mean free electron num- 
ber density, and ds is the comoving line element. The 
reionization history is encoded in the functional form of 
rieiz)^ which is proportional to the mean ionization frac- 
tion Xi{z): 



where is the mean baryon density, /ie = 1-22 is the 
mean mass per free electron in atomic units, and nip 
is the proton mass. Equations (72) and ( ffsl) re late the 
reionization model [in our case. Equation (|71|)] to the 
optical depth. A measurement of the optical depth from 
the temperature and polarization power spectra of the 
CMB will thus constrain reionization. 

Unfortunately, optical depth measurements have a ma- 
jor shortcoming: as integrated line-of-sight measure- 
ments, their dependence on the duration of reionization 
Az is extremely weak, since all that matters is the col- 
umn density of free electrons to the last scattering sur- 
face. Optical depth constraints are therefore primarily 
constraints on z^. This is certainly true for WMAP, and 
[53] finds the same to be true for Planck. The latest 
WMAP fits find Zr = 10.5 ± 1.2 [39 . 

More recently, the South Pole Telescope (SPT) has 
used limits on the kinetic Sunyaev-Zel'dovich (kSZ) effect 
to place constraints on the duration of reionization. The 
kSZ effect refers to the Doppler shifting of CMB photons 
as they scatter off coherently streaming free electrons, 
and contributes to the CMB anisotropy because reioniza- 
tion is not spatially homogeneous {i.e. it is "patchy"). 
By combining reionization simulations with SPT mea- 
surements of the high £ portions of the CMB power spec- 
trum, a tentative 2a constraint of (A2:)^^^ < 7.9 was 
obtained [54 , where the SPT team defined (A2:)^^^ to 
mean the difference in redshift between Xi = 0.2 and 
0.99. // one assumes the reionization model of Equation 
(71), this translates to an upper limit of Az < 2.64 with 
our parameterizatiorj^ We use this SPT constraint for 
illustration purposes, but note that it is non-trivial to 
convert a measurement of the kSZ contribution to the 
CMB into a reionization constraint. Allowing for more 
general reionization models may weaken the constraint 
on Az 55 . 

we show the constraints set by the 



In Figure 30 



WMAP optical depth measurement (in purple), along 
with the SPT kSZ constraint (in orange), both at 2a. 
From observations of the Gunn-Peterson trough, we also 
know that reionization is complete hy z = 6.5 [56 , so we 
include a prior that Xi{z = 6.5) > 0.95 (in red). The 
Planck satellite is expected to improve on the WMAP 
measurements of r, potentially capable of achieving a 
precision of Ar = 0.005 [57 . This translates to a tighter 
set of limits on z^, which we also show in Figure 30 as a 
set of dashed lines. 

Finally, the EDGES global signal experiment has ruled 
out very rapid {Az < 0.06) reionization [22 . Being a 
single-dipole experiment, EDGES does not have suffi- 
cient angular resolution to provide constraints on more 
extended reionization scenarios, as we discussed in Sec- 



Xi{z)pb{z) 



(73) 



This is a smaller number than the SPT figure because z = Zr — 
/S.z/2 and z = Zr -\- /S.z/2 correspond to ionization fractions of 
Xi = 0.269 and 0.731 respectively, giving a narrower time period 
than the one the SPT team used in their parameterization. 
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FIG. 29. Pairwise projections of the likelihood contours (la and 2a) for Az, Zr, and T21 from Equation (70), assuming fiducial 
values of Az = 3, = 11, and T21 = 27 mK. In each case, the projections were obtained by marginalizing over the unplotted 
third variable. Experimentally, an angular resolution of 5° and an integration time of 500 hrs is assumed. 
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FIG. 30. A comparison between CMB constraints on reioniza- 
tion and the projected performance of our fiducial global sig- 
nal experiment, which assumes an angular resolution of 5° and 
an integration time of 500 hrs. Projected Planck constraints 
are given by the dashed lines. Existing WMAP constraints 
[39 are given by the dark purple bands. SPT measurements 
of the kSZ effect disfavor extremely long duration reionization 
scenarios, whereas EDGES rules out the most rapid scenarios. 
Also included is the constraint (from quasar measurements of 
the Gunn- Peterson trough) that reionization is completed by 
z — 6.5. Assuming a reasonably pessimistic reionization du- 
ration of Az — 2.2, our fiducial reionization experiment gives 
tight la (white) and 2a (yellow) error ellipses. Again, we 
caution that the SPT constraint here is both aggressive and 
model dependent, so may prove weaker when more general 
models of inhomogeneous reionization are considered. 



tions [V| and VI B On the other hand, an experiment 
with a 5° instrumental beam and an integration time of 
500 hrs (i.e. our fiducial reionization experiment from 



Section VIE) can mitigate foregrounds much more ef- 



fectively, leading to the white (la) and yellow (2cr) like- 
lihood contours in Figure [30] In computing these con- 
tours, we assumed Az = 2.2 in order to examine a 
reasonably extended — and therefore quite observation- 
ally pessimistic — reionization scenario that has not yet 
been ruled out by the current SPT observations. En- 
couragingly, we see that even under these conservative 
assumptions, global signal experiments can improve on 
the projected Planck Zr constraints, and in addition pro- 
vide tight limits on Az. This holds true even if the SPT 
limits become less stringent with future analyses, as we 
can see in Figures [28] and [29l In any case, a global signal 
measurement should provide an interesting independent 
cross-check on all the aforementioned probes of reioniza- 
tion. 

It must be noted that the quantitative details in this 
subsection hinge on the specific parametric form of the 
ionization fraction that we assumed [Equation ([7T|]. A 
different parameterization would affect all of the limits 
in Figure [30] except for the WMAP and Planck con- 
straints (since optical depth measurements are insen- 
sitive to ionization history). However, we expect the 
overall message — that a global signal experiment with 
good angular resolution can significantly improve our 
constraints — to be a robust one. 



VII. CONCLUSIONS: LESSONS LEARNED 

In this paper, we have considered the general problem 
of extracting the global 21 cm signal from measurements 
that are buried in foreground and instrumental noise con- 
taminants. We developed a mathematical formalism that 
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extends the spectral-only signal extraction methods ex- 
isting in the literature to incorporate angular methods 
for separating the cosmological signal from foregrounds. 
Crucially, our proposed data analysis algorithms do not 
require a priori information about the form of the cosmo- 
logical signal. This makes our methods immune to pos- 
sible mismatches between theoretical expectations and 
observational reality, complementing other approaches in 
the literature that assume a parametric form for the sig- 
nal. One might imagine using our methods to make a 
first measurement of cosmological signal, which would 
allow a theoretical parameterization to be confirmed or 
revised; then, one could make use of the parameterization 
to precisely determine reionization parameters. 

We also used our formalism in conjunction with a fore- 
ground model that was constructed from a combination 
of empirical data and analytical extensions to explore 
the various tradeoffs that one encounters when designing 
a global 21 cm signal experiment. The following is list of 
"lessons learned" from our exploration: 

• The simplest use of angular information is to down- 
weight heavily contaminated region in the sky. This 
can reduce the effective foreground contamination 
by as much as a factor of 2 (Figure [8|. 

• Focusing on just the cleanest parts of the sky and 
simply excluding the dirtiest pixels from observa- 
tions/analysis has two effects. On one hand, the 
average amplitude of foreground contamination is 
lower. On the other hand, having fewer pixels 
makes it harder to average down the errors in one's 
foreground model, as well as to take advantage 
of angular correlations in the foregrounds to miti- 
gate their influence. Numerically, we find that the 
second effect dominates, which suggests that one 
should cover as much of the sky as possible. 

• Most of the statistical significance in one's detec- 
tion comes from the £ = angular mode, as op- 
posed to higher ^ modes in our whitened {i.e. fore- 
ground model pre-divided) data. However, this sig- 
nificance comes mostly from high signal-to-noise 
measurements of certain spectral modes that alone 
are insufficient to accurately constrain the shape of 
the cosmological spectrum. To get small enough 
error bars to faithfully reconstruct the shape of the 
spectrum, it is necessary to take advantage of angu- 
lar information to clean foreground-contaminated 
modes that would be inaccessible using spectral- 
only methods. 

• The most easily measurable feature during the dark 
ages is (as expected) the trough between 50 and 
80 MHz. 

• Errors integrate down more quickly with time for 
experiments possessing fine angular sensitivity than 
those that do not. In the latter case, the extra in- 
tegration time allows one to achieve even higher 



significance measurements of spectral modes that 
were already measured, without having a substan- 
tial effect on the detectability of the spectral modes 
that were not measurable due to large foreground 
contamination. As mentioned above, these were 
the modes that were previously limiting the accu- 
racy of the extracted global signal, and since their 
measurability was limited by foregrounds and not 
instrumental noise, extra integration time does lit- 
tle to help. For experiments with angular resolu- 
tion, angular foreground mitigation methods allow 
the residual foregrounds in those modes to be suf- 
ficiently suppressed for instrumental noise to be an 
important contribution to the errors, and so extra 
integration time has a larger effect. 

• Reducing the error in foreground modeling reduces 
the final measurement error. However, unless the 
fractional foreground modeling error sq can be re- 
duced significantly beyond 0.01, dramatic gains are 
unlikely. 

Taking advantage of these lessons, we examined some 



fiducial experiments in Section VI Incorporating the var- 
ious recommendations listed above, we showed that small 
errors could be achieved for both the Dark Ages and 
reionization. With reionization in particular, our angu- 
lar methods allow extended reionization scenarios to be 
detected, which is encouraging since there are reasonable 
expectations that the duration of reionization > 2 
[58]. Crucially, we point out that the angular resolu- 
tion and integration time requirements that are needed 
to utilize the methods of this paper are modest compared 
to those of large interferometric arrays such as LOFAR, 
MWA, PAPER, or GMRT. Thus, while it is necessary to 
go beyond a traditional single-dipole experiment if one 
is to make full use of angular information, it should be 
possible to do so using a mid-scale experiment without 
incurring the expense and technical difficulties of a large 
experiment [38 . If time (and autocorrelation informa- 
tion) is available on the large interferometers, they can of 
course be used to attempt a global signal measurement 
(such as that being currently pursued by the LOFAR 
team [59]), but this may be an unnecessarily expensive 
option unless the relevant data has already been taken 
for other purposes. 

Aside from instrumental noise and foreground contam- 
ination, global 21 cm signal experiments have many other 
issues to contend with before becoming an experimental 
reality. For example, instrumental calibration remains a 
challenge |37j, although the accuracy requirements may 
be reduced by the results of this paper, since one no 
longer needs to rely solely on an exquisite spectral mea- 
surement for foreground subtraction. In any case, it is 
encouraging to see that by suitable experimental design 
and optimal data analysis, the potential show-stopper 
of instrumental noise and foreground contamination ap- 
pears to be a controllable problem, paving the way for 
global 21 cm signal experiments to make some of the first 
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direct measurements of a mostly unconstrained era of 
cosmic evolution. 
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